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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methiomne, 
N=Asparagine, P=Proline, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3257 


17158 


A 


3279 


404 


65 


NLVSSPKRSVFFISPSQKIFFFFFNWKE 
SKISLPKEGGGPLLLLIRQRTPIFFFFF 
FYFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFL * TACLF * TFKFTKTK 
R 


3258 


17159 


A 


3280 


226 


383 


G I * KNS MDGCCLGP F WFETGSHSHPG W 
SIVWRNLGSLQLLPPKFKRFSCLS 


3259 


17160 


A 


3281 


91 


26 


LFFFFFFFFFFFFFFFWSERSS 


3260 


17161 


A 


3282 


3 


321 


HTVI Y YFGKNEPFG Y IGVA* AM XSVGS L 
GFMA*AHHIFTGGIDVYSPAYLTCATI I 
VAI PTGVKVFI * LATLLiGS GMKRSAAVL 
* TLGF I FLFTARGLTG I VLSNS 


3261 


17162 


A 


3283 


393 


112 


PPLRVFFPPFPLKNFFFSSRRFFFWGGV 
APFFPPPKKGFFSKIPPGFFFPPPLKKK 
I FFSSP P * FWAP PGFFLKGPPPFFFFFF 
FFFFFFFFF 


3262 


17163 


A 


3284 


360 


47 


PPLHRWVPPPPP*KFFFSPQTKFFGKVF 
SQNPPPPKKSFFPKNPPGVFYSPPKKKK 
KFFLPPPIFFPPPRFFFTTPPPFFFFFF 
FFFFFFFFFFLPLFFWS VS F 


3263 


17164 


A 


3285 


386 


157 


FFSPNPLIFWGGGGPKFPPPKKRFFSKN 
PPGVFFSPPIiKKKKFFFPPP*NLAPPRD 
FLKGPPPFFFFFFFFFFFFF 


3264 


17165 


A 


3286 


3 


385 


DAWVPCKLYLGVFFCRFVKISAILILKI 
LI FNI ENKS I * YLLKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKPTPKKKKK 
KDSR 


3265 


17166 


A 


3288 


81 


270 


I X KLBKKKKKKKKKKKKKKKKKS * KGNE 
KTRGPKKKVHRDVTKKILCYKI CPFNKG 
ISLIGT 


3266 


17167 


A 


3289 


3 


145 


LGRLRQENHLNLGG * GCREPKSHHRILA 
WATEQDS I S KKKRKI GGPV 


3267 


17168 


A 


3290 


404 


47 


GPGGKI FFKKMPEEKI FSTQKKKGFFP P 
FPPKNFFFPPGGFFFWGGGGPIFPPPKK 
GFFPKIPRGVFFTPPKKKKIYFFPPREI 
WAPPGIF*KGPPLFFFFFFFFFFFFFLV 
LNDILLA 


3268 


17169 


A 


3291 


106 


362 


KKKKKKKKKKKKKKKKKKKKKKKRMKNK 
KKKKKGRGGGKKKKPRGAQ I I RGKKKI I 
FFFKKVFK* KGGGDL* KKKS FCGENNLE 
NTHKE 


3269 


17170 


A 


3292 


3 


226 


NPLVNLNLGLLFILATSSLAVYS IL*SG 
GASNSN YAL I GALRAVAQKKKKKKKKKK 
KKKKKKKKKKKKKGGGVL 


3270 


17171 


A 


3293 


337 


2 


F FNRE LL KS KNKE TE KNHD YHKE R I WHF 
DRPYKFYMLFFCFGVLIFFLYLPFFIYF 
FFFFFFFFFFFFFFFFFFFFLNSIiNI*F 
FS INTHKCGNRGQGGGAIGPRLSNGRV 


3271 


17172 


A 


3294 


327 


21 


KFFFSPPNKYFFFVFPHKFPTPTKMFL* 
KIPPPPLISPPPKEWHQGPPPPHIFAPP 
SHPFFGPPHFFFFFFFLFFFLWRTGSRY 
IAQAGLELLGSSYPPAS 


3272 


17173 


A 


3295 


367 


101 


FFFSFF*KKKNFFPPPPLWPPPPNFL*N 
PPPPFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFL VNSLCRRYRG* SLLTY 
YAWL 


3273 


17174 


A 


3296 


178 


2 


KGPPFFFFFFFFFFPDS ITLFI ITKNEK 
KTRCTSIGE* I TWYTHTGE YYS VMKRNE 
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LS 


3274 


17175 


A 


3297 


147 


1 


RGPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFTW*FALTFLV*NF*AI 
ITF 


3275 


17176 


A 


3298 


3 


347 


P FGE GE TNGRFD LDT KKAL VS KKKKKKK 
KKKKKKKKKKKKKKKKKKKRGGALKKKN 
RGGQKKTGEKKKKFFFKKGEKKKTPGNF 
* KKKKFLGGDNLAQT P PKKKNLWGKKKN 
FWG 


3276 


17177 


A 


3299 


405 


2 


WRPKPENSPSPGGPNPPGQKN*TPFPPK 
TP KLGRGCGGAP Q FPLP VRARGENP PNP 
RFLGFN*PKFPFPPFAL*GGGNNETFFQ 
KKKKGPRF WAGVL PTGGAEGE S I PHLS P 
ASGSSRHPWCFMLCRPSRLCkL 


3277 


17178 


A 


3300 


125 


376 


DQPVQNSETSSLQKKKKKKKLiFPGGGGP 
PLFPHFLGGWGGGGTRPPG*RKRGKSGV 
PQPPPPGGRVPPPPHPPKKWGNKGGPPP 
PGKS FFFFFFFWRDEVSLFCTGWSQTQT 
PGLKQSIHLSLPRWLDYGREPPHLAIHF 
FPTRPRTRG 


3278 


17179 


A 


3301 


348 


1 


PQRLKILGGGGPPNSPPPKKGFFPKTPR 
GF*NPPPKGKKKNFPPPRKIGPPQGFF* 
RAPPLFFFFFFGPWFFFFFFFFLFLENG 
VSLYCPGRSRTPGLKLSSRLLASLRAGI 
TGVS 


3279 


17180 


A 


3302 


2 


354 


GRVGGRVGS PTTLTT*NPHSHEKTPSCS 
YPKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRKRKKKKKKT * GGGAYI KKKG 
GGSPRTEEENQYCFLLGGKKNILRGS CL 
KKTPH 


3280 


17181 


A 


3303 


362 


26 • 


SSQAAHPTKKKKKKKKKKKKKKKKGGAL 
KKNFGGGKNKGGE KKKI FFFKGGGKKNP 
GGNFKKKNFFGGGKIGENPPKKK*RFRK 
KKKFLRGKGGKKS 


3281 


17182 


A 


3304 


371 


3 


NQKKPPPPVFFFFFFKKKKNPPPATPPP 
GRP * FFPPQKKFFPEGGPPPFFKKGAFF 
PPPKKNPFEPFFPGGVFFF*KKKNIKVF 
PPPKKKKKKKKKKKKKKKKKPKKQKKGR 
PARDLEPHAS 


3282 


17183 


A 


3305 


2 


196 


DR*LFSTNHKDIGTIjYLIiFGA*AGVLGT 
ALSLL IRAELGQPGNLLGNDH I YNVIVT 
AHAFVIIF 


3283 


17184 


A 


3306 


4 


339 


LFSTNHKDIGTLYLLFGA*AGVLSTALS 
LL I RAELGQPGNLVGNDH I YNVI ATAHA 
FVIIFFIVRPIIIGGFGN*LNPL J IIGAP 
DRAFPR INNI S F * LLPRS I LLVRASAI A 


3284 


17185 


A 


3307 


2 


367 


KPPAITQYQTPLFV*SGLITGGLLLLFL 
PDLAAGITILLTDRNLNTTLFDPAGGGD 
PILYQHLF*CFGHPEGYILILPGFGI IS 
HIGTYYSGKKEPLGYIGMV*AMRSIGFL 
GFIE*AHHIL 


3285 


17186 


A 


3308 


243 


370 


KGHRPGMVTPVCNPSTLGGQGGWIRRSG 
I *DQPGQQGVTPSL 


3286 


17187 


A 


3309 


2 


363 


LVPLIIQAPDMVFPRINNIGF*LLPPCL 
LLL I AYAI ME AGAGTG* TAYPP LAGNYS 
HPGASVDLTIFSIjHLiAGVSS ILGAINFM 
TTI INI *PPAITQYQTPLFV* SVLITAG 
LLLLCLPG 
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3287 


17188 


A 


3310 


1 


364 


IiY I TGPTLALT I ALLL * TPLP I PNPLVW 
LNLGLL F I LAT S S LAG Y S I L * S GGAS NS 
NYALIGALRAVPQTISYEVTLAIILLST 
LLIRGS FNLSTL I TTQEHL* LLLPS * PL 
AII*FISTL 


3288 


17189 


A 


3311 


215 


2 


GMAKLKARFLPKKGGKRSLSPLLWGSGF 
PTSGTGETFFFFF*DRVLLCQPGWSAVA 
QS * LTTAST F WAQAI 


3289 


17190 


A 


3312 


153 


3 


MNKCI CMCVCMCVCFRSHKHTVCAWVCV 
Y*V* TYTVCVCVCVCVCVCVC 


3290 


17191 


A 


3313 


149 


1 


KTPPGFFPGFKGPF*FKGPPLFFFFFFF 
ETEDGVLLCRPGWSAVSRDRA 


3291 


17192 


A 


3314 


282 


69 


KI G KTAP F F S W P RR VP FLKKKKKKK* GL 
TL S SRLE YGGM I S LELLGS SDTLAS ASR 
VARI IGQCPHAWLT 


3292 


17193 


A 


3315 


479 


60 


MMQHS I KHALVE I TGWVRWLRPVIPALW 
EAEVG E FISTS I S KTNNRKR SN I VKMS I L F 
NL I YR INAP FFKFL I ALLNKNS KSKS I W 
NLKS* *STQ*SSRNNVGGITISDF*THQ 
Q ATE LRQ E FLHTGL VHRS TVS R FRFHA 


3293 


17194 


A 


3316 


129 


3 


S FHQ KVW P DAVAHACS P S TLGG * GGR I T 
GSGDQDHPGQHGE 


3294 


17195 


A 


3317 


3 


156 


EFHRVSQDGLDRLTS*STLLGLPKCWDY 
RCEPPRPASFPSFLTVRISPFL 


3295 


17196 


A 


3318 


36 


340 


TGLVIAEHLLFFFFKKGFWFCPPGGRAG 
PQGNLMEFPVSGGKGIFCPGPPKSGE*R 
APPPSRGKFWFFKKKGGLPFWPGGV*TS 
DPRGTPPPLGPKGGEYR 


3296 


17197 


A 


3319 


326 


2 


KAFFLNFKPHPGFA*GLKFKPNSLLP I * 
SFIRKT*AFLICL* IQLNRADGKIPLKP 
GVI P FKKCFPNFTP P LFFKKNCS S TPNL 
FFFFP FFFFGDG I S LCRPGWS AVA 


3297 


17198 


A 


3320 


175 


312 


QAGVQWCDLSSPXPPPSGLT*FY*CSLP 
RTWEYRCPPSRPCNFSIF 


3298 


17199 


A 


3321 


363 


51 


RWGPRLPVAWLPSVSRRAEECSPGRGAH 
GQGGGGGGQAR*REGPDEWTGGAPVGMG 
S P S LNRGAGRG PPQPGPSS KS RRGRVRC 
TPHSCLIGCGLSFFICKMG 


3299 


17200 


A 


3322 


344 


187 


LRQETCFNPGGKNFG*QKLPPCPPAWAT 
RGWVS KKKKKRKKRKEKIRI CVLT 


3300 


17201 


A 


3323 


3 


313 


TRRERERERERERERERERERERERERE 
REREFFFFFF*KKNLSPPPGGKKRGGGG 
FHSPPFWGEKDFLPPRPPKGGERGGRPP 
PPVFFFFFKKEGAPPGGGV 


3301 


17202 


A 


3324 


365 


1 


SLGWWRVTPPPFPLGPRGGSPPRPDKRG 
SPGRVKFGGVKPFPKVKP*PPPKRLDPS 
GA*FFFSPKKICQGPPPRKKKKKEAELS 
ILCTKFPLQEEEVMQVPPPPCTCSSEPG 
INCICKRHC 


3302 


17203 


A 


3325 


1 


269 


ARALSLSLSLSLSLSLSLEFFFFFLAPT 
Q I LGGGLF YPPKGGAHLSNRLGGGRVF F 
GPPGKKTAPLGGGRVIKKKTPPGTPL*L 
SPPPGGPFFFQGGQKKLSPPPAGLTGGP 
PPWGGKKAPPPKFGSGPKKKKKTLERER 
ERERERERERELV 


3303 


17204 


A 


3326 


1 


316 


ARGERERERERERERERERERERERFFV 
SPPGPDTLQDRGGVSLFASDPTHEIYLP 
CPLRGPLSERDPEELHGVCFFSEPSHRW 
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FPEEGVGDICQC*HRGECGSP 


3304 


17205 


A 


3327 


2 


369 


ARGDVTRESTYQGHHTPPVQKGLRYGI I 
LFITSEVFFFAGIF*AFYHSSLSPTPQL 
RGHWPPTGITPLNTLEVPLLNTSVLLAS 
GV*IT*AHHSLIENNRNQIIRALLITII 
LGLYFTLLQA 


3305 


17206 


A 


3328 


1 


259 


GTRKKHSTILIIREIDIKFMPRFHISPI 
RLVKI*ALANIRC*QQCGKVGNLLHC*F 
S KLVQ S TGRAI WHALVLSLARS RALS LS 
LH 


3306 


17207 


A 


3329 


2 


352 


ARGTLL L L CLL F FALTI Y PRWCD VTRKS 
T YQGHHT P F VQ KE P L VLGGKL F I T S * V I 
FFAGFF*AFYHSSLTPTPLLGGHWPPTS 
I SLFNPLKKPLLNTSGLLASGVS IT*AH 
HSLLQ 


3307 


17208 


A 


3330 


3 


361 


HEE P LG Y I RMA* AM I S I G FLGL I VRAHH 
I FTGGIDVDSRTYLTSATI 1 1 AMPTGGK 
VFR*LATLHGSNMK*SGAGL*ALGFMFL 
FTVGGLTGIVLTNS * LDIGLHDTYYWA 
HFHYVLS 


3308 


17209 


A 


3331 


1 


340 


GTSGDTRAYFTSAT 1 1 IAI PTGVQVFS * 
LATLHGSNMK * S AAE L * ALR F I F L FT VS 
GLTGIVLANSSLDIELHDTYYWAHFHY 
VLS IGAEFAI IRGS IH* FPLL* GYTLDQ 
T 


3309 


17210 


A 


3332 


2 


352 


ARGD VDTRAYFTSGTI I IAI PTG VKGFS 
* LATLHGSNMK* SAAVL* ALGF I FLFT V 
RGLTG I ALANS S LD I VLHDT YYGGAHFH 
YVLSIGAVFAIIGGFIH*FPLFSGYTLD 
QTYAK 


3310 


17211 


A 


3333 


85 


370 


QVSHRVRPCFERERERDRYRHTQRECER 
AKEIYSGFFLSASEMESCSLAQAGGCKE 
LRMCHCTPAWVTQGD C VS *NKK* HWEKK 
GLWFLGVNIST 


3311 


17212 


A 


3334 


362 


3 


QEVKVSMNRDHTHSLQRGEQKRNSVSKK 
KKKS S QE KLKRVHTQTHT * I F I AVL FI V 
AKNGKQFKFPSTCEWINNM*YIHTMEYY 
SATKNQLIHVTT*MNLNNIYTKFLKARQ 
KMITYSC 


3312 


17213 


A 


3335 


3 


351 


HEGTTLHHLFLLETGSNHPLGITFHSDK 
KTFHP Y YT I KDALGLLLFLLS LMTLTLL 
*PDLLGDPDNYTLAHPLNTPAHIKPE*Y 
FLFAYTILRSGPNKLGGGPALLLSILXL 
AIIP 


3313 


17214 


A 


3336 


17 


350 


FIFTLTETNRTPFDLAEGESELFSGLKI 
KYAAGPLALLFITEYTNIIIIDTLSTTI 
VLGTTYDALS PELYATYFVTKTLLVTSL 
FL* IRTAYPRLRYEQLIHLL*KNFLPL 


3314 


17215 


A 


3337 


2 


349 


ARDSL I S P I LRLHPRPNLRQNAFHYHIH 
RGKSNFLPTTLSRPIRNAPTLLGLPRCI 
HHMKHP 1 1 CRLIH* FPLFSGYTLDQTYA 
KMHFTI IFIGVNLTFFPQHFLGLSGMPR 
R YS D YPDAY TT * N I L S YVGS F I S LTAAI 
L 1 1 FMM * E AFAS KRKVL I VEE P S I NLE * 
LYGCPPPYHTFE 


3315 


17216 


A 


3338 


2 


386 


ARAILNAMAFLILTERKILGYIQLHRGP 
NWGPYGLLQPFADAIKLFTKEPLKPAT 
STITLYISGPTLALNIGLLL*TPLPIPN 
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P LRNLNLGLL F 1 1±AT S S LAD YS I L * S G * 
ASNS N YAL I GALRAW 


3316 


17217 


A 


3339 


175 


16 


ILGDLFPPAWLHPFLFLLLLFSRPSLAV 
T E AAVQ WRNLGLLQ PLPPG Y * S S WC 


3317 


17218 


A 


3340 


1 


369 


GTSNLSLLF IVATCGLAAN* ML * *GGKI 
N* I YTLRVALAAI IHTILYE I TLS I ILL 
*TLLISGSFNLSTLITTQEHL*LLLPS* 
PLAI I * F I STLAETNRTPFDLAEGESEL 
F * GLN I E YS AR 


3318 


17219 


A 


3341 


1 


363 


GTRGGT I L P AM VLML I D VP S LR I L Y I TN 
KVNE P S L T I KS NGHQW Y * T YQ Y TD YGGL 
I LNS Y I LP PLFLQ PGDLRI LDVDNRWIi 
PIETPIRIINT*QDGLHS*AGPTLGLKT 
DAIAGRVNQ 


3319 


17220 


A 


3342 


181 


315 


PKSPPTPPPPTXXXXXPGGNPPGPTL*G 
P F YWGGERGPP F * TGGP 


3320 


17221 


A 


3343 


3 


183 


HEVSQSCRELLTSGDPPASASQSAGISG 
MHHRAWPVS *TFHI FNIYCGPGTVLKP Y 
LNL 


3321 


17222 


A 


3344 


83 


354 


VLGARCPCAGVSRVSMEPWFLWQQACLL 
P WWR C CLS TL YAMRADRD VP * TSRTGP V 
MVAHACS PSTLGGQGGWVTRSGV* DQPG 
QDGETP 


3322 


17223 


A 


3345 


124 


379 


GQGI CFMVVSLGLSHRLAHCRYTVGVNN 
LLCSSSLPFLLPLLPLPPPEGPGWDDIN 
1 1 IF 1 1 FRDKVLLCCLGWSAVT* SQLTT 
A 


3323 


17224 


A 


3346 


3 


368 


YEP I P * PLTGALS ALLMTCGLAM* IHLH 
S I TLL I LGLLSNTLT I YQ* WRD VTRES T 
YHGHHTPPVQKGLRNGIILLITSEAFFF 
AGVF*AIYHSSLAPTPQLGGHWPPTGMT 
PLNPVQGPLL 


3324 


17225 


A 


3347 


1 


352 


GTSAGDVNYG-* I IRYLHADGAKI FLRCR 
FLHSGRGLYYGSYVYSKT*NIAINLLIA 
T I ATAF I G YVLA* GQ I LF * GATVSTNLL 
SDIPYIGTDLSQRI *GGYSVDRPTLTRF 
FTFHF 


3325 


17226 


A 


3348 


2 


360 


ARAE ISP LHS NLGNE S E TP S QKKKKKKP 
PGGGGKNWNPYPSGTPPPNPPKGGE*RE 
EPPPLTKRVF*KKKVFFEGPRVKKNLGG 
KGKKKRGFPAR* KNP VFKAKGKGRNFGP 
QAFFFKK 


3326 


17227 


A 


3349 


226 


366 


PQSFCSVASWPAGDLMEI*AWQKFEYKP 
GKVPMDPEGCLLPLAKKKEEEEEEEEEE 
EET^EEEEE^»EEET*!E'FiEE^)F!FFiG'F!F.'R'F'IT i\7" 


3327 


17228 


A 


3350 


2 


374 


ARGGGYS PHRATLTRTVTFHS I LP F 1 1 A 
GLAAIHLLFLHETGAYTPL* ITSHSEKT 
TFHPYYTMKDALGVLLFPLSLTTLTLCS 
PNDLGEPHDYTLTNPLNTRPHIKPE *DC 
LFAYTNMRSGPN 


3328 


17229 


A 


3351 


1 


266 


LGLVEHFLTLTTKLTSSNISDAQEIQSV 
*NILTAI ILNLMVLPSLRMLYITNEVFD 
PSLTIKSIGHQWY*AYEYTDFGGLIFNC 
YILP 


3329 


17230 


A 


3352 


207 


362 


ILFLLKQKIYVLFGFSN*GSSIPELAHS 
DAYQTKE I CSSGKVYKYLQCWEKW 


3330 


17231 


A 


3353 


178 


365 


NHGIDKGLRYRI YTVQQEDNS VKR* LKD 
FNQHL S KDN I RS AD KHMKRCP TS L VI N I 
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nucleotide deletion, \~ possible 
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MPIKPP 


3331 


17232 


A 


3354 


359 


3 


LQVRSLMPLGQFHPP*NFFCSFFPKPKI 
P FFFLFFLKGVF VFFPGWRARGQFWVFG 
PPPPKFKKFFFFTLLR1SIWGFRDFPPSRG 
KFFFFFFFWYF* * RLRFTMLTRLVFNS * 
PQVISC 


3332 


17233 


A 


3355 


2 


354 


ARE P S P * PLTGALS AIlLMTCGLSM * IHL 
HSITIiLILGLLTNTLTIYQ*WRDVTPES 
TFQGHHTP PVQKGLGYGI ILFITSEVFF 
FAGFL * AF YHS SIjAPTPHLGGDWPPTGV 
TPLNAL 


3333 


17234 


A 


3356 


1 


364 


GTREM I CCS ALS PRIHLS FHRRWPDWHC 
ISKLITRHRTTRHVLRWSSLPLCPINRS 
WICHHRRLHSLISPILRLHPKPNLRQNP 
FHYH IHGRK* SAAVL * ALGFI FLFTVGG 
LTG IVLiANS S LD I GLHDT Y YVGAHFH YV 
LS IGAGFAI IGGFIH* FPLFSGYTLNQT 
YAKIHFTIIFMGVNLTFFPQHFLGLSGM 
PRRYSDYPDAYTT 


3334 


17235 


A 


3357 


2 


376 


ARGGQPEDYILILPSFGIIPHIVTYYSG 
KKEPFGYICMV*AMISIGFLGFIA*AHP 
IFSVGIDVNTRAYFTSATI I IAIPTGVK 
VFS WLATLHGSETMK* SAAAL * ALRF I FL 
FTVSGLTGIVLCN 


3335 


17236 


A 


3358 


357 


139 


AQPWFFFLETVFCHAAQAGLKSIGSSDL 
PTSASQSVGTTGMSHHAWPERSC*QNTF 
TYYVQSIYSDDIRKSMF 


3336 


17237 


A 


3359 


277 


1 


S CQKD I S IGFLF VPGFRVKGVAQEHL VG 
VRMYVLVPTVGGDGKERER*RGIRHTPP 
P S ER VRQRE RERGRDRDRE KE RAS QRKP 
EMDRETSC 


3337 


17238 


A 


3360 


261 


2 


TICS KCAWL C F S KTLLTNT CAS RSD LAH 
GP * CPNLCSGIFS SAWTD I SRIiLLFFCL 
FVLRQSLTVAQAGVQWCNLCSPQPLPPG 
SC 


3338 


17239 


A 


3361 


199 


354 


HLPYWPMFMATFNFFFFFETEFHFFPQA 
GGQWGD FG* LKP P PARLKQFS CLN 


3339 


17240 


A 


3362 


3 


364 


HEDRDTPAYFTCAMI I I AI PTGVKVFN * 
LATLHGSNMK* SAAW*ALRYI FLFTVS 
GLTGIVLSYSSLDIVLHDTDYWAHFHY 
VLS I RAVFAI IGGFIH* FPLFSGCTLNQ 
TYAEIHFT 


3340 


17241 


A 


3363 


3 


360 


HEETYIPKDQKYSFLHDSQTSFCFSNSI 
PTPSNMEETQQKS VSGCRLP * AGMGETC 
GQGSRAAQPLPSLPLQNLKLLRISLLLI 
EAWLEPERVLKSMFANNLGYDTSDSDDY 
HLLKDLE 


3341 


17242 


A 


3364 


352 


27 


VYLS S QRNS AAS VNRMT WTDRS LSPYT 
LGWHRPDRLFHFVFSLFVYTFVERGSQS 
IAQAGVQWCDKSSL*P*TPGLK**TCSC 
LP * VAGTTGTHHYTGPL I LLNL FN 


3342 


17243 


A 


3365 


347 


99 


GGPPIiPRGFSPPPGARPLAPPWGPPQES 
PPPPPPKTAPQKKKKAPS IPAGQWLVEQ 
NPRPQPFKKPPGVGF*NPPPPLNMGF 


3343 


17244 


A 


3366 


3 


319 


HEECIAHTLCFFNLMS*RTTVILHRMFS 
IVTAI * YLLSS Q YVI G Y Y I YVC I HTHTH 
MYIHTHRHRTFICVCVCVCIYIYTSIC 


3344 


17245 


A 


3367 


3 


474 


YELLLLNGPILNARAFLILTERKILGYI 
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QLHKGPNAVSPYGLLQPFADAIKLFTKE 
PL KP AT S T I TL Y I TAP T LAL T I DLLL * T 
RLPIPNPLDNLNLGLLFILATSSLPVYS 
III* SG*A*NSNYALIGALRAVAQTI * *E 
VTPT I I LLSHYYKWLRN 


3345 


17246 


A 


3368 


2 


348 


ARGLQDATFP I JEELITFHDHALII I VL 
ICFLGLYALFLTLTTKLTNTNISHAQEI 
ETV*TILPAIILVLIALPSLRMLYITDE 
VNDP S LT I KS I GHQ WY* TYQ YTDYGGL I 
FNS 


3346 


17247 


A 


3369 


64 


363 


KKWGFVPQQKKGGGRISVMGTPIFKGKG 
NFPP*PPGEGGTKGPGPKPK* IWIFKKR 
GGLIFWPKQFPTPGPKGTHPPGPPKGGK 
*RGGPPSPGGFNFKRG 


3347 


17248 


A 


3370 


34 


355 


AKGKE Y I E T VAE KKKKKKKAP P P I * KAP 
*NPGKKPPLALKFPPPSQKAPPPALFCP 
* KGP P PRVF P KKKKKF P P FGRKGFFFTG 
GKPP 


3348 


17249 


A 


3371 


1 


374 


FLHSLHLSLRLLYLP*KFSMFIILLKAY 
YTLCVCVCVCVCVCVCV* PLNKIHINHF 
TCSTVRGRSNPIFFPPLLN*RPFYTT 


3349 


17250 


A 


3372 


429 


3 


FFFFLKKQKISRVFPRGPEFFGQRKPFW 
AKGPPRAPQKGGEKGKNPSPPPPLIFFK 
KGEKPQGGPLFKKGLGFFSPPKVFFKKG 
VFKSGPGGP * NFGFRGFS P PS P PKKGGK 
KRKPQSPGFFFFFFFFEILGCSGRSRSR 
T 


3350 


17251 


A 


3373 


2 


404 


LGTDLSLI IRADLGQTGDLLGNDHI YNA 
XVTAHAFEIIFFIEIPIIIRGFGlSr*LIP 
LI I GAPDMAF PRINNI S F * LLP PS I LLL 
LASAIVEAGAGTSGTDYPPLSGNYSHPG 
ACVDLT ILSLHLAGVSS ILGAI 


3352 


17252 


A 


3374 


2 


417 


AFV* I AH I TRDVT YG * 1 1 R YLHAKGAR I 
FFMCLFLHIGRGLYYGTFLYSET*NIGI 
XLLLAT I ATAF I G Y VLL * GQ I S F * GATE 
ITNLLSAIPYIGTDLIQ*I*GGYSVDSP 
TLTRFFTFHFILPFIIAGPNPLHLTT 


3352 


17253 


A 


3375 


85 


1 


FWPGAVAYAC YP S TLGG * GGW IMRSGVR 


3353 


17254 


A 


3376 


396 


1 


KEGYI KGPRGENFF* KNPERKI FSRQRK 
GGVFSPLPPKKFFFFPKGLIFLGGGCPI 
FP P P KKR VFS KNP PGVF INP P * KKKKI F 
SPPPENWGPPRVFLKGPPPFFFFFFFFF 
F F F F F QTL FLGGCG YNTKLR 


3354 


17255 


A 


3377 


3 


118 


LL P P S LLL LLAYA I L VAGAGTG * TD YP P 
LTGNY*KTKA 


3355 


17256 


A 


3378 


301 


82 


ERKGPPQRGGKKGPKAQKPEKKKPKKRP 
PPPQN* FFSNPPKKKKGGKDPPSKKRGG 


3356 


17257 


A 


3379 


2 


195 


DR*LFSTNHKDIGTLYLLFGA*AGVLST 
GLSLLIRAELGQPGNLLCNDHIYNVIVT 
AHAFVIIF 


3357 


17258 


A 


3380 


21 


411 


VFCTNH*DIGTLYLLFGA*AGVLRSALS 
LLIRAELGQPGLLLGNDHIYNVIVTGHA 
FVMNLFIVIPIIVGGSGN*LGPLEIGAP 
DMAYPRINNIGF*LLPPSLLLLLAYAIV 
EAGAGTG* T VYP PLAGNY 


3358 


17259 


A 


3381 


2 


421 


GRVGGRVGSTNHRD I GTLYLLFGA* TGV 
LGTALSLL I RAELGQ P GNLLGNDHMYNV 
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I VTAHALAKI FFILVPI I IGGFGNGLVP 
LIIGAPDMAFPRISNISL*LLPPSVLLL 
LESVIAEAEAGTGRTVYPPLAGTYSNPG 


3359 


17260 


A 


3382 


3 


185 


VIYSTIFAGTLITALSSH*FFT*VGLEI 
NMIAFIPVLTKKINPRSTEAAIKYFLTQ 
ATAS I ILL I AILFNNI LSGQ *TI TNTTN 
QYSSLI I IMAIAIKLGIAPFHF* VPEVT 
QGTPLTSGLLLLT*QKLPSSISSRKQPH 
P 


3360 


17261 


A 


3383 


5 


398 


FFGHPEVYILILPGFGIISHIGAYYSGK 
KEPFGYLGMA*AMISIGFLGFIA*AHHI 
FT VG IDVHTRAYFTS ATI I IAI PTGVKV 
FN*LATLHGSNMK*SAAAL*ALGFIFLF 
TVGGLTGIVLANSSLDIVL 


3361 


17262 


A 


3384 


1 


250 


LGNTGETLSLQKNV* KLARCGGTCLRSQ 
LL,RRLK*EHCLTPGGRGCSEPRSCHSSP 
AWATGQDLSQQQQQKYPLRIFSAIIKS 


3362 


17263 


A 


3385 


456 


0 


I FFF S QVE TGIHHI GQAGLQLLTSGDAP 
CVGLQKSWDYRPEPLHLAWLIHSLLSNF 
YYC*FGFCKLSVFILINVFCNKTCNSSM 
FLFSYS * NP CAQ * KNEG PL LNKWP CR I T 
DAWADAWVAPVF 


3363 


17264 


A 


3386 


1 


225 


PSRNFLVL VDKL ILKFI* K YRGPRTAKT 
T L KKKKKWRLTLL I FKS Y Y KT I V I T I G 
WYGFQDRQVD*WNRIESLE 


3364 


17265 


A 


3387 


3 


400 


LLI LGLLTNTLF I YP * WGDATRES TYQG 
HHTPPVQKGLRYG J ILFITSQVFFFAGF 
F*AFYHSSIiSPTPQLGGHWPPTGITPLN 
PlaEAPLLNTS VLLiASGVS IT* AHPGL I E 
NNPNP I FRALLI TNI LGLYF 


3365 


17266 


A 


3388 


24 


407 


IASGRPFFFFFFFPKGKFFISRGGGPFS 
P PKI * KNWGKGKS F FP PPQREPKKRGVP 
KNPSLSFYPFGGKPPLGPPPKLVPPSS* 
PGKKAFFFF*TPPPGFLTPLGGGLSPLK 
GGYLLAPHFVPTEGPIi 


3366 


17267 


A 


3389 


148 


413 


LNLNNNHWG KKHL LNT YS CT * LE E KKKK 
KKKKKKKKKKKKKKKKKKKAPGGGFF 


3367 


17268 


A 


3390 


157 


14 


RVS ACCQVWW* MP WPATREAEARESLE 
PGRHRLRSCHCDRDPVSKS 


3368 


17269 


A 


3391 


2 


398 


LFSTNHKDIGTLYLLFGARAGVRGTALS 
LLIRAELGQPGNLLGNDHIY1WIVTAHA 
FVINFFIVIPIIIGGFGN*LVPLIIGAP 
DMAFPRINNI SF * LLPPSLLLLVAYAI V 
EAGAGTG * TVYPPLAGN YSH 


3369 


17270 


A 


3392 


2 


395 


DQKDIGTLYLLFGA*AGVLGTAVS:LLIG 
AELGHPGYL IGNDHVYNVIVTAHAFVI I 
FFIVIPIIIGGFGN*LVPLIIGAPDMAF 
PRINNISW*LLPPSLLLLLACAIVEAGA 
RTG*TVYPPLAGNYSHPGA 


3370 


17271 


A 


3393 


396 


54 


GYPPNLTFVTSSLTSVPSPSHPPSHVCS 
PTPAIPQTPRSPPSESTPPTEVSLTTPT 
LSLGAI S S LAS AGPQGAS PAS PGFWGTS 
PP*PYS*NRPQTAACGVFSQLFFIRPEV 
P 


3371 


17272 


A 


3394 


417 


60 


KGWVFQRILWPGPGGSPRYSRPFGGPRG 
L I PLGSGV* TP PGPHWETPFFFNNQP I S 
GQGGGGPLIPLLGGVKPEKIFYPGNQSF 
H* PKIPPHPPPLGAKQNFPPKKKKPQKN 
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TKGFWV 


3372 


17273 


A 


3395 


417 


3 


FFFFFFFQHHFGFE AAA* YWHFVDW* L 
FLYVS I Y* * GS YS FS INS 


3373 


17274 


A 


3396 


229 


1 


RS F WTKTMGFS C YKDFF * TNGNKDTT YQ 
NLWDTAKAVLRGKF I ALNVP INKLERS Q 
I SNL I S Q VKEGQAWWLMP VI 


3374 


17275 


A 


3397 


344 


165 


ELKS P Q P RKRNG C F FLAVD P DEGF * NLN 
FCGNGQGLNAPGGYPCE I PMGFVPNAYG 
KTF 


3375 


17276 


A 


3398 


1 


409 


HKDIGTLYLLFCARTGILGTALSLLIRA 
ELGQPGNLLGNDHIYNGIGTTQALVRIL 
FIVIPIITGGFGN*LIPLIIGAPNMAVP 
R INNI S F * L * P P S LLLLVAYG I VEAGAR 
TG * TG YP P LAGN Y S HPGACVD LTC4 


3376 


17277 


A 


3399 


3 


396 


KDIGTLYLSFLS * AGVLGAALRLLIRAK 
LGQPRNLLRNDHIYNVIVTAHAFVI IFF 
IVIPI I IGGFGN* LLTLI IGAPDMALPR 
I NN I S F * LLP P S LLLLLAYG I VE AGART 
G * TAY P P LSGN Y S H PGAS V 


3377 


17278 


A 


3400 


49 


350 


KSNSHTE VQAGQEQ KS F FRS KHKAGRAG 
SDPAPYVLAGCLCRSISSFLFFFFSLFF 
FFFFKRDGV* QGYSSGSNTPGVKQS S CL 
GLPKYWDYRREPLHPA 


3378 


17279 


A 


3401 


1 


398 


KPPAITQYQTPLFV*SVLITAVLLVLSL 
PGLGAGI TI LLTDR S LNATF FD PAGGGD 
PILYQHLF*FFGHPEAYILILPGFGIIS 
HIETYYSGKKEPFGYIGMG*AMISIGLL 
GFIA*AHHIFTG* IDVDTRS 


3379 


17280 


A 


3402 


388 


42 


T P L FN P PGAKC VN P F FP KKLKKKNP S RA 
MGGPPLFPPLWGGKPKGFP*NRKSGAPV 
THFAP PAP PPGGKREPPP PKKKKKKKRK 
EEKE SNRAVQ I KKRGNKKGRLP VLD PAE 
ESQ 


3380 


17281 


A 


3403 


413 


2 


SPHPLKNFFLPPKPPNSGGGLAPFVPPQ 
K*GSLPKIPRGRYKSPPFKEKTNALPPR 
GKPGPPRASSKRPPLFFFFFRGRVPLCH 
PG* S TVQCVQAQCTLELLGS SNFLTSS S 
PVTWGHRQGPPRP INFFI FFRGRVP 


3381 


17282 


A 


3404 


433 


274 


LGWAHACNP STLEG * GGRTAWS PGVQD 
QPGQHSETPISTNFFWYFRISYFHI 


3382 


17283 


A 


3405 


131 


427 


GPPPQPFFFWGKRGGPNPFFFFFSEKRD 
VF*QFFLRRRGPQKFFKPPFFCLQKTSL 
FSEKKKKKGFGPPLFPQKKKGCGGGPHY 
KKEGFTLNSFLFQGDGVSSVAQAEVQWR 
NISSLQALPPGFT 


3383 


17284 


A 


3406 


413 


137 


SPPPPPQGGIFFFFKPR*ETFSPPPNLG 
FFSPPSPLKFFFFPKGFIFLGGGGPKFP 
PPKKRFFFQNSPGGFFFPPF*KKNFFFL 
PPVILGPPRVFF*GPPPPKK 


3384 


17285 


A 


3407 


319 


540 


IVFKDQCEESFTTLNVDIRNHQNLLDSL 
E QHGKGDLL *GANAYHCE KCNKKVRAVQ 
LCLILITFNGFRKNRFFW 


3385 


17286 


A 


3408 


2 


403 


SLHLGGGSCIVGAI YFITTI INIKPAGI 
TQYQTPLFV*SGLITGILLLLSLPVLAA 
GITILLTDRNLFTTFLNPGGGGDPILYQ 
HLL*LLSRPEVYILILPGFGI ISHIGTY 
YSGKKEAFVYIGVV* AIIS IGC 


3386 


17287 


A 


3409 


2 


142 


KKENYRPVSLNRDAKSLNQ IQLHI * KI I 
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HCNQ VG F I PGMQGW FN 1 3j 


3387 


17288 


A 


3410 


407 


48 


PPGGKI FF YKNPE KKI FPTQQKKGVFPP 
FPLKNFFFPPRGKIFGGGGGQNAPPPKK 
GFFPKIPQGVF*SPPKKKKKNFFPPGKI 
WAPQGVFLKGPPPFFFFFFFFFFFFFFF 
FFFLGVR 


3388 


17289 


A 


3411 


3 


402 


S YRLE * QLEDKAAPPQDFTQQLRWPEKQ 
HRMVWPDKEDMHKQLVEASETLKSQAKE 
L*DAHQQQKIALQEFLEFNELMAELYSH 
NHKVWDKEEEMEVAMHKADMMWQEI*RS 
KKLTKRMLFTQMRRITPVIPS 


3389 


17290 


A 


3412 


1 


224 


LLFQLLRRLRQENHIaNLEGRSCSELKMH 
HWTP PWGTE * DCL * LKKKQ S AYLPNNT I 
KKLTG S RT I ED I CTE YNK 


3390 


17291 


A 


3413 


202 


1 


IVMSWPGWTHA* IPSTLGGQGGWITRS 
GVRDPLASAFQSAGITSVSHCSQLIFVT 
VFYLS PLSHLY 


3391 


17292 


A 


3414 


407 


309 


P SFKAS KDRLTLLLGANAAGDLKFTPMF 
I YHS ENPRALKNYTKSTLPVP VLCIRNI 
KAWMIAHLFTAWLNEHFKPIVETHCSGK 
KISFKI LLLi INKAPGHRRAQMERHKMNV 
VFMLANTTP ILKLMDQGVF * TFI LCLS I 


3392 


17293 


A 


3415 


404 


46 


LKKP P FQKFLKI LKKKKKGGGGGPPP * T 
PPFGGGRGGVPPRAGVSKNPVPQGETLF 
FFKNPKFPGVGGPAP*FPLLRGVNKK3STF 
FNLGGGGFKKRKFSPPPPPRGKKFFFPK 
KKKKKKE 


3393 


17294 


A 


3416 


23 


404 


S VL I TGALDLL S M P VLGAG I T I LLTDRN 
LKTTFFDPAGGGDP ILYQHLL* I FGHPE 
DNILILPGFGIGSHIVTYYSGKKEPFGY 
I GMA* AMKP IGFLGD I AGAHHVFTG * I D 
VNTRSYFTSDTINMG 


3394 


17295 


A 


3417 


38 


469 


SGTTHASAI.FDPAGG*DFILYQHLF*IF 
GDPEVYILILPGFRI ISHI VTDYSGKKE 
PFGYIGMA*AMRS I G FLGF IG * AHH I FT 
V* IDVDTRAYFTSATI IIAIPTGVKVFT 
* LATLHGSNMK* SAAVL* ALRF I FLFTV 
GGLT 


3395 


17296 


A 


3418 


1 


419 


FSTNHKDIGTLYLLFGA*AGVLGTGLSL 
LIRAELGQPGNLLGNDHIYNGIGTAHAF 
VriFFIVIPIIIGGFGN*LDPLIIGAPD 
MAFPRINNISF*LLPPSLLLLLASAIGE 
AGAGTG* TVYPPLGGNYSHPGASVDLT 


3396 


17297 


A 


3419 


397 


107 


KNSAPPFSPIFLP*GPKFAFFPPGKKVG 
KKIF * KFPAPGPQGVYIjREGGGS PSPPQ 
TPNWPNKKAAFVLPPPPFFVFGNPKKKK 
KKAVAFSSPNRR 


3397 


17298 


A 


3420 


3 


253 


RGCREMRLCHCSSAWAIAGVSEEKKKKK 
KNWQ*CLLGICSYLDLWLPGLFKWLSQI 
GRIVELQLLELKQYLRGITKQTLKILH 


3398 


17299 


A 


3421 


1 


393 


RTRTIKTPAITQYQTPLFVGCALITAAL 
liLLSLPVIiAAGITILLTNRYLNTTFFDP 
VGGGDPILYQHLF*IiFGYPGDYIVILPG 
FG 1 1 S HMVAY YS GKKE P FG Y I RMD * AM I 
S IDFLRI IV*AHHI FTVEN 


3399 


17300 


A 


3422 


142 


377 


LSKNCYKPSYVGFVCQCKFFNQLSKFLE 
VKDLYCMVKDC**EG*MAQPL*KQFVLV 
AIjKLNIVLPYNKAISLPGIYSTD*KTYT 



758 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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DTQNLHTTVYSSSLIIIKN*KQQRCPSV 
GE* INKCFI YTTEYYSLTKRNKLSSHMK 
A 


3400 


17301 


A 


3423 


2 


435 


LILPGFGIISHIVTYYSGKKEPFGYIGM 
G*AMISIGFLGFIV*AHHIFTVGIDVYT 
RAYFTS AT IIIAI PTGVKVFR * LATLHG 
SNMK* SAAVL * ALRF IFLFTVGGLTGI V 
LANS *VDIVLHDTYYVGAKLHYVLS IRA 
VFAI 


3401 


17302 


A 


3424 


2 


436 


LNTTFFDPAGGGDPILYQHLF*FFGHPE 
VYILILPGFGI ISHI VTYYSGKKEPFGY 
IGMG*AMISIGFLGFIV*AHHIFTVGID 
VDTRAYFTSATI I IAIPTGVKVFS*LAT 
LHGSNMK* S AAVL * ALRF I FLF TVGGLT 
GIVLS 


3402 


17303 


A 


3425 


1 


193 


PTRPPTRPACSELRSRHCTPAWVTE *DS 
VS KKKKKKKF S P ARGAH VC T P T F LGGQR 
GKKFCFAP 


3403 


17304 


A 


3426 


3 


446 


HKKFLFPKSFFNRVFPPTKKKNPRGPGF 
FFFFLKKIFFFPPGGI*SGGFGSLQLFP 
PRVKKFFSPPPPEKGGFKAPPPGPGNFF 
FFLKKRGFSFFGGGFLK*NPGFPPPPPF 
KKWGVPKGGAPPPPPFF 


3404 


17305 


A 


3427 


202 


1 


FS PGRMLGNT I KL P YTHS F I LMNS QNNL 
EKYKVGGI PLSNFKTNYKATAI KTMWYW 
HMNR YMD * WNR 


3405 


17306 


A 


3428 


159 


1 


NPPHE KNFCS FFQME S HSADQ AGVQ WCD 
LSSLQPSLRGFKRFS*VSLVSSWDH 


3406 


17307 


A 


3429 


22 


298 


ESYASTAKATERDMSDRMLSASGMPRRY 
SDYPDAYTT*NILS S VGS F I SLAEGILI 
I FM I * EAFAS KRKVL IVEEPSINLE*LY 
GCPRLRGR 


3407 


17308 


A 


3430 


267 


26 


GITFFFFFWLEI *LYYWLFLANKLYFHE 
QIWFGLVFFFGLLLFDFSFYSDYFPLLL 
CLGSFSNF*CLIHCFFSVIIVLLF 


3408 


17309 


A 


3431 


274 


376 


NKG * I S CS I S P PQHT VRRGLTLS PRLE C 
SGVISAHCRLCLPGSSDPPASAS 


3409 


17310 


A 


3432 


177 


47 


FYFFFFFFFFFFFFFFFSLLFFFFFFFF 
FFFFFFFFFFFFQINRSFIASFKYHK*V 
LIFKKKKN 


3410 


17311 


A 


3433 


379 


83 


F KRH FTKEDLRMVTEH I KS CS S S LATRA 
MQIETTMAHCYTFMRLAKIKKYQYTKCC 
*R*RVTGTLTYCWWECKMTQPFWKTIVR 
RFLKQY I YHVIGYV 


3411 


17312 


A 


3434 


214 


3 


AASTFFFFTRSTYYVCIFLKSKRMINK* 
NYIKLKSFCTTTTKINKMKRQPTDWEKM 
FANHINHHEGLILK 


3412 


17313 


A 


3435 


1 


353 


GGQGQPGQ* SETLSQ* KVSWAW* LQLHR 
RMRQDNLL S P GD QG * S E P C * C Y YT P AWA 
RVRLCQNRS TOEKSLHDI CNDKK* QNI * 
I I SVPEGKQRMKGLENLFNE ILDENYPS 
LARDL 


3413 


17314 


A 


3436 


29 


394 


VFMEVDFFFFPFFFFFFFGKRNFFFPPR 
GGGGGPF* I FGPPPPRGKGNFPPPPPGK 
GG KKGPPPPPGFFFFF*KKGGSPPGGGG 
FLTPPLWDPPPWPPQRGGFGV*TPWPGP 
GSDFLNFFFF 


3414 


17315 


A 


3437 


108 


2 


P VAVAHS CNPNTS GGRGG W I S *GHEFQT 
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SLANT VK 


3415 


17316 


A 


3438 


215 


3 


LLLLREFFLPKHREVFIFVFYFMRQGLT 
LTEAGVQWRSHGSLWPQTPGLK* SSCLS 
LPSIWEYSAYHHIR 


3416 


17317 


A 


3439 


1 


186 


QTS S F S FRRS V I TN F FKL KDD V * TH * KE 
AKNLEKRLDEWLIRIKSVEKTLNDLME1) 
KTSTRP 


3417 


17318 


A 


3440 


290 


3 


PGLGGGPRYSRFSGGLGPKI PLNPEGEG 
SIKLNPPPAPPPGGPN*DSFLPPPPKKK 
KKKTHVLFAGSGNLLQPLQPGAFTTEVN 
GGLAQQSPKVL 


3418 


17319 


A 


3441 


163 


2 


PAGLPVKPPRFQDKLNFPFGPFFFFFFF 
* I*DKVLLCHPGESATVHSSLGDRA 


3419 


17320 


A 


3442 


131 


1 


RPRRP LTIKSI GHQ W Y* T YE YTD YGGL I 
FNSYILPPLFLEPGS 


3420 


17321 


A 


3443 


162 


2 


PGAAKSTGHLPLRFHLRLGAVTHAGNPR 
TLGGQGRW I T * GQE FKTSLAKRVRP 


3421 


17322 


A 


3444 


2 


130 


WEKIFANHVSDKGLIYRIYNELQQLYK* 
KTNGP1 KKRPKS TRP 


3422 


17323 


A 


3445 


2 


117 


LALLGLKPGLK*ATCLGLPRYWDYRHQP 
PHPACFSKGI 


3423 


17324 


A 


3446 


350 


101 


KAFSS S LGS SRKKGD S DAGF C I HFKT VP 
FGFFWTLELKKGIPFFLNFFFFPFLRVL 
KGSKNPF*KRGPFFFFFFFFDNFYIWN 


3424 


17325 


A 


3447 


106 


1 


LLQPNGQLGPGTVAHGCNPSTLGG*GER 
ITTSGVR 


3425 


17326 


A 


3448 


28 


333 


GQQEQNS ISXKKKKKKKKKTPFKKGKRK 
IPGGKKKPLGKSRAPPPLGGGEKNPPPG 
KRAPEKI FKKKPRGNWETFKRMEKGFLK 
GQ* PGNP * KVKPLWGGKK 


3426 


17327 


A 


3449 


81 


339 


P I VS S C P GVAGAL KQAMTLE F KVYQHHV 
VANCRALSEALTELGYKIGTGRDTDGVQ 
QALLWGLRLAS * P I VSSCPGVAGALKQ 
AMTLEFKVYQHHWANCRALSEALTELG 
YKIGTGRDTDGVQQALLWGLRLASQFV 
QPGCGPGSAAAAAMM 


3427 


17328 


A 


3450 


166 


322 


RDFSTTVLITDRTSRQKIPTDVSQLNLI 
DI YKTL* PTIGKYTFFSSTYRSLR 


3428 


17329 


A 


3451 


257 


354 


KFLSFFRDRVLLCCPGWSAW*S*FTAA 
LSSW 


3429 


17330 


A 


3452 


286 


3 


TDPFPI KRAKEREMLQ ITKIRS IRENF I 
TDQQKTKEL * EEQLYAHRVDNLDEMDKF 
LEGYK*PSMTHGKIENLCRPITSKDIES 
VIKNLLTDPP 


3430 


17331 


A 


3453 


358 


353 


R* *KLLMMPC* IETNKT*NTCTVIKEEK 
GQPGTVAHACNPSTLGG*GGRIT 


3431 


17332 


A 


3454 


108 


353 


THIiGGNPVRLYAWLLLHYVAS YRI FS VS 
HCKDQTYLP ILL*NFLMELGTVAHTYNP 
S I LGGQGRR I T* VQE FETSLANMVKC 


3432 


17333 


A 


3455 


342 


50 


G P VT PN I P P VP P LGGQ KE LF P P KKKKF * 
KFWGSPPPLFFKKPKNPRETPPESCPKP 
KKTPIPVKV*GKKTP*TFSKKKKTPPIP 
EL FLL KKKKKL IN 


3433 


17334 


A 


3456 


343 


2 


KAPFFFFFKKFWSQKKRGLFFFFFKNFF 
IPGFPGPTFFFWGFFFFKKGPPVF*LKK 
FFGGPLPPLKDPPGVFPFSPF* FWDFPP 
PPPFFFFFFFPFETVSLCRPGWSAVSRD 
C 
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nucleotide insertion 


3434 


17335 


A 


3457 


2 


259 


LTKPTESGTEDKKGRDSKESEGKTERTE 
SLRSQNGGENSV* SCPSTSSTAALNTAA 
AANPLALNPQVSLSSLS CCP I S I KGS FV 
P 


3435 


17336 


A 


3458 


186 


334 


AGGNKKMEKRNM*NKEI F* WFKKNWPGA 
GAHACNPTTLGGRGRRIMRSG 


3436 


17337 


A 


3459 


207 


1 


YFKP TLKIND KG YLNET I D KFD PRD IPTR 
TSNLTL*EHALFPSTCGIFMRRDYIEGH 
EVNLNQFQRIRTQ 


3437 


17338 


A 


3460 


268 


3 


PLNPGPKQSS P PGLiPKPWPFRGEPLPLG 
PWFFFVKNRGRP*AFFGTNPLAFFLFFL 
* DRYL FCHPGWS VWQSQLTVQ PPPLRL 
KQFS 


3438 


17339 


A 


3461 


350 


2 


AETAHNINNAFGPGSANKCTVRWWFK*L, 
CKGVKRLEDKALGGWPSKVAIMQLRAI I 
KGAPLKPTRKVAKELNIEPSAVIQHLKR 
IGKVKKLHKWVPHELHENKRNRFEVWSS 
LIIP 


3439 


17340 


A 


3462 


312 


1 


HIVTYYSGRKEPFGYIGMV*AMISIGFIi 
GFIV*AHHIFTVGIDVDTRAYFTSVPI I 
IAI PPGVKVFS *IjATIiHGSNMK* SAAVL 
*ALGFIF!LLPVGGLTRIVLA 


3440 


17341 


A 


3463 


181 


3 


FLPLLAAFSGLLLFSGFLGVFLFCFFET 
ELPRLECCGT I S AHCSL * LLGS S DS CAS 
GTH 


3441 


17342 


A 


3464 


176 


263 


VGHDAHN P S I LGG P GGW I T * GQ E FKVDA 
A 


3442 


17343 


A 


3465 


239 


336 


EPGEVAYSCNPSTLGGRSGWII*GQEFE 
TSLT 


3443 


17344 


A 


3466 


88 


326 


QTSLIiNTI**PLH*KNIPLPPPPLPPT:b 
TPPFSPPPPPPPPKKKKRKRK 


3444 


17345 


A 


3467 


4 


345 


EEERRRERKKKRRKKEEKKKEEREREEK 
ERKKKKKKKE KKKRRGRQEKRKGTG * IE 
RKQ 


3445 


17346 


A 


3468 


265 


335 


QMNPDRHLQF*DKVLIjCHPGWGTVARS* 
LTATSRSSCLSPPSGWDCRRVPPCLASF 


3446 


17347 


A 


3469 


115 


342 


INKQINKIKDPTSYNILLSQSLI*FKAL 
TKLNS VKAQRGE KATEEKFEGS IS* FMR 
FKQINHLYNIKVQGEKPSTD 


3447 


17348 


A 


3470 


56 


328 


RTYPTKPQKMRDPEDPTSRPRYIPRHLY 
I TDE YND P SLT I KG I GHQWD * T YE YTD Y 
GGLTFNSYILPPLFLEPSDLRLLDVDNR 
VEPATRA 


3448 


17349 


A 


3471 


3 


329 


RIGARDESSLAVTHKPRKIHRKLTLWVF 
VTSTD S L VM P RPD KNHQ WAFNKNC F P LVDV 
GIDPYLVYHLRPHQAQGIIFLYECVMGM 
RMNGRCGAI LAD * NGLiGKTliQC ISL 


3449 


17350 


A 


3472 


154 


2 


PNLLYPKSYWKKKDIWTPMFIVALVTIV 
KIQKQPRCPLMDKSIR*L*YIH 


3450 


17351 


A 


3473 


132 


340 


ALKTENTTTLCLDF * L IETVR* * MCVFK 
KKKKKKKKKKKKKKKKKKKKDPPKKKKK 
IK 


3451 


17352 


A 


3474 


3 


345 


VIYSSIFAGTLITALSSH*FFT*VGLEI 
NMLAFI PVLTKKINPRSTEADIKYFLTQ 
ATAS I FLL I AI LFWN I LSGL * T I TISJT TJSF 
QYSSLIIIMAIAIKLGIAPFRRRGREFQ 
IY 


3452 


17353 


A 


3475 


1 


254 


FHHVGQAGFTPDLR* STRLGLPKCWDYR 
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REALCLVHILLYLPMIGPFRIQKLiLKAA 
VFVFLQLHQAFIQSLLTLFYFISFPCNS 


3453 


17354 


A 


3476 


351 


186 


SVLSADNTEMGFHHVGQAGLELLSSSDP 
LAP AS * SAG I RGVRHNTWP VLT I TLN 


3454 


17355 


A 


3477 


330 


3 


GEKVFE I PLTRDNVS EILRCFLMAYGVE 
PALCDRLRTQPFQAQPHQQKAAVLAFLV 
HELNGSTLI INEMDKTLESMSRYRKYEW 
NVKERLRRL KTARA* RLGRS Q VKK 


3455 


17356 


A 


3478 


373 


248 


NRLNPGGGGWSEPRSQHSTPTWTEYDS 
VSKK* INKNKNWK 


3456 


17357 


A 


3479 


3 


92 


EGQGCSE P * S RYCS PAWVTE * DC VSKNF 
F 


3457 


17358 


A 


3480 


380 


2 


VEVPLLNTSVFFASGVS IT * AHHSLI EN 
NRNQI IQALLITNLGGLYFTLLQASEYL 
ESPFTISASI YGSTFFVATGFHGLHVI I 
GSTFLTMCFIRQI I FHFTSKHHFGLEAA 
A* CWHFVDWCIFL 


3458 


17359 


A 


3481 


225 


1 


ERAERWEQVMIALFSPNYIRLSFPISP 
TPWRHKFRFHLKGYIWPGTVAHTCNPS 
TLGSQGGQI TRSGD * DHPG 


3459 


17360 


A 


3482 


332 


1 


I KYFLTQATAS 1 1 LL I AI L FNN I LSGQ * 
TITNTTNQYSSLI I IMAIAIN*GIAPFH 
F*VPEVTQGTPLTS 


3460 


17361 


A 


3483 


277 


1 


LSMSSFSISSIPSASINS*AKLFSSAES 
NSLECTFSSSFSPNSSESFSSSFPSCSC 
AS * FSNSQSLI KAS SSTEAS PLMS S AS C 
FLPLPLVG 


3461 


17362 


A 


3484 


404 


209 


ALS VQ PLWRPRQENGVNPGGKASR* PRF 
P P CG PAWATKGE L VP KKKKKK I S FLGHD 
QSLELiLLQR 


3462 


17363 


A 


3485 


31 


347 


FCILERKAFLYW* CWKFFFFFFFFFLE 
KGFFFLPLLVFFWCYYHLKGASLP*VFL 
FFYLHCYSSLFTLFFIFLQSLTPHFFLV 
* TFFYYDPLMFFFLTLFLELV 


3463 


17364 


A 


3486 


327 


161 


QKKVRDPYEQL*ANKFECN*EEMPRFLEK 
HKLPKLTQEEIKNQ 


3464 


17365 


A 


3487 


311 


3 


PRRGR FQL TQNFAL P FNLGNKTK I P FKK 
KKKKT E P L> CP CT C VCAAFVCL C FHLCT * 
P CD CAHDCDN I C VC VN I C VYVC WTHLH 
TGLCS CVDCDHI CVGTCC 


3465 


17366 


A 


3488 


245 


1 


RPRRLIiALLL 1 1 1 T F * LPQ LNGY I E KS T 
P YE CGFDP I S PARVPLS IKF FLVAI TFL» 
L FDLE I ALLLPLP * ALQATNLPL IV 


3466 


17367 


A 


3489 


2 


400 


AAP ILGMLGLRQRGKYKRQRQLLCLWSA 
I LLFL YFLNKLL * LTKKKKKKKKKKKKK 
K KKKKKKKKKKKGG P RGGN P Q I L KGK I G 
KPPGGFFKGGGGPKKIFFFKNRGALFFK 
KKP P Q KKNLGPGP P GL KNFLKE KKKA 


3467 


17368 


A 


3490 


57 


366 


INVFAGKDQLQPLCACPSSPIISVESDR 
PDMYQCGLNHFI YLI I YLRWS FALVAQA 
GV*WHHLGSLHPLPPEPKCFSC 


3468 


17369 


A 


3491 


25 


384 


IRGTF* FFWGGLGGLSKNNPL* KKNFSP 
PKLGVLPPPLFFPGGGFFFFFPGAPPGF 
KKKKPPPKKFFF* KKGAPPFFFFFFFFF 
FF 


3469 


17370 


A 


3492 


170 


41 


AQPDQHGETTFLLKIQKLTRHGGICL*S 
QLIGGLSFITQKITC 


3470 


17371 


A 


3493 


1 


240 


VliS PADKTNVKAAWGKVGAHAGE YGAEA 
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LERMFLSFPTTPS *FLDFFFFLEKIGFH 
HVGQS SLGFLTSRDPSAYNSLLGS 


3471 


17372 


A 


3494 


359 


204 


TQWLMPVILAPWEAEVGR* PEVRSLRPA 
LNS P FFKRL WIMM YIMNI I ILS 


3472 


17373 


A 


3495 


1 


178 


DRLS PGV* GS SEL * LCHCTPGWATEPDH 
VSRKKKKRKKVGLGKNKAKQKHPRGYYR 
AELi 


3473 


17374 


A 


3496 


361 


1 


LGPKLLKKPFWISIGIKGVAGLPGFFPT 
PPQ*G*NFGLKRGGFFFYFFEAKGWFNT 
PGGKQRGPFRSLEPFPFRVKLFFPPNLP 
NNWGHKGAPPKPPFFFFFLRWSFALVAQ 
AGVQWCDL 


3474 


17375 


A 


3497 


276 


79 


AEVYKI KCELGAVAHACNP S TLGGQGGR 
ITRS * VSASFTKKYWQLALMKELLNE 


3475 


17376 


A 


3498 


3 


476 


FGTSMS 1 1 TL I TPRLARGNNNLFS CFPT 
FTFDHLATPLLILTT*LLPLTSMASQRH 
LFREPLSRKKLYLSILNSLQSSLIITFT 
GTELI IFYIFFETTLIPTLAI ITR*GNQ 
PERLNAGTYLLFYTLVGSIiPLLIALIYT 
HNTLG S LNI L LL T VS AQE 


3476 


17377 


A 


3499 


581 


2 


AHACNLNTLGGRGRWIT*GQEFDTSLAN 
K 


3477 


17378 


A 


3500 


380 


135 


RHQSGQIGKTPSLVKIPKLTPPKGGVL* 
SHLTKRARQEKGLNPKSKGFN* PKLRPC 
P PT I VAKKQNP VS KKKKQTRQE P S A 


3478 


17379 


A 


3501 


1 


386 


GSR*GNHPERLNAGAYFLFYTL*GSLPL 
LIRLIYTDNTLCSLNILLLTLTGQELSN 
S * AHNL I * LAYT I AF I VK I P L YRLHL * L 
P KAHAQAP I AGS IVXAAVLLKLCGYGI I 
RLTL I LNPLKKH I D YP 


3479 


17380 


A 


3502 


2 


389 


EKQL * KDKQVYRATHRLLVLGAGESGKS 
TIAKHMRILHVNGFNGEGGEEDPHAARS 
YIDGEKATKVQDIIiNNLKEGIETIGAVM 
SNLVPPVELANPEHQFIVDYILSAMNVP 
DFDFPSEFYEHAKALWE 


3480 


17381 


A 


3503 


2 


379 


PFGYIGMV*AMISIGFLGFIG*AHHIFT 
VG I D VYTRAY FT CAT 1 1 IAI PTGVKVFS 
* LATLHGSNMK* S AAEL* ALGF I FLFTV 
SGLTG I GLTNSSLD I RLHDT YYWAHFH 
YVLSIGAVFAIIGG 


3481 


17382 


A 


3504 


281 


68 


FKMRFGWGHSQTVSENYKPLLKE IRENP 
NK*KNIPCSWIGRISIVKMPTLPKVIYR 
FNAFPIKLPSPFFTL 


3482 


17383 


A 


3505 


139 


263 


ETGSHYVPQAGL * LLAS SDPPATAS QN I 
GITHVSYYVQPL* P I FWEAVAGGSLEAR 
S 


3483 


17384 


A 


3506 


2 


427 


LKTSMTM I S TTLLT * LRPAWH * AQ YQT P 
FFV* SAL I TAGLLFLFLP VLAAG I TVLL 
TDRNLNT T F FD PAGGGD P I L YQHL F * F F 
GHPEGYILILPGFGIISHIGTYYSGKKE 
PFGYIGMV*AMISTGFLGVIG*AHHVFT 
GG 


3484 


17385 


A 


3507 


343 


3 


GWHSRSSGGASSPSSPSHRSRLRVSSGS 
LGRL * WS KS S KTSVS CG WARDTEKPPL 
SKIEASSSDSCESVMARANAVFFTSTTG 
EKPASSASLSLSVWASGSSPPGAANSSS 
S 


3485 


17386 


A 


3508 


3 


363 


GP GD KAARR I ALENLD IT* KWGMRTHKC 
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peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-Pro?ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine 7 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AKQLLS KQPF * S S RTGDLKLELQIELVR 
ETKRKC*RVL* IDRALTSHLYSLLQTQD 
ALGDAS ADL IQKTS* LQEE FGY YAKTQK 
LLCRNGET 


3486 


17387 


A 


3509 


3 


434 


GPGDHVARG I SGKKYDIGQKWGMGT YKC 
TKELLSQRFGLGSPTVDLEL*LQIELLR 
ETKRKYESVLHLGRALTAHLYSLLQTQH 
ALGDAFADLRQKSPQLQEEFGCYAQTQK 
LLCKNGETLLGAANLFVSSIYTLGTKTM 
EDTL 


3487 


17388 


A 


3510 


1 


401 


GTRKNS FLHDSQTS FCFSDS I PTPSNME 
ETQHKSNLELLRISLLLIEPWLEPVRFL 
KSMFANNLVYDTSDSDDYHLLKDL*EGI 
PTLMGRLEDGNRRTGQILKQTYSKFDTN 
SHNHDALLKNYGLLYCFRKDMN 


3488 


17389 


A 


3511 


117 


2 


FKFT FSEMESHS VA* AGVQWRNLGSLQP 
LPLWFKQFSC 


3489 


17390 


A 


3512 


491 


90 


KKVK I VLAGFFFKPPPPKN * KKKGhK * R 
KGVGPRGYTVNPHPCGYKPFGGPGVQNP 
PGQ KGE TP FFQKKNKKPGGGGGAL I P P P 
KGGGRGVPLSPEKFGLKFPPGPPP*GKK 
KNRFQKKKKKKRQN* PQKSQC 


3490 


17391 


A 


3513 


2 


31 


GKGAPTTSLINVAGTKIIAKWEDNKLP 
GAICSLTCGGAHIGTPMARDERVNLLSF 
TGSTQGGKQVGLMVHERFGRRLVELGGN 
NAIIAFEDAHLNLYVPSTLFAAVGTAGP 
KCTTARRQLIHES IHDEA*NHFPH 


3491 


17392 


A 


3514 


2 


390 


KEFHKTHPDENITLAPDPEGPDGLPPVA 
YNPWMDIRQREDGQTLNIREPYGPIPVD 
F*RKIRQSYFASGA* LDTQDGRLLSALD 
DLQLANS TI IAFTSDHGWALGEHGEWA* 
YNNFDDDTHDPL I FYDT 


3492 


17393 


A 


3515 


3 


410 


RGTGNNWAQGH YTKGAELVD S VLDWRK 
E S * S CD CLQGFQLTHS LGGGTGFGMGTL 
LITKIREEYPNRIMNTFRVMPLPKV*DT 
EGEPYNATLLVHQLVGNTDETYS IDNEA 
LYDICFRTLKLTTPTYGDRHHLVS 


3493 


17394 


A 


3516 


1 


396 


GEDAANT YARGH YT I GMEN I DLGLDR I R 
KLADPCTGLQGFLVFHSFGGGTGSGFTS 
LLMERI SCDYGKKNKLEVS I YPAPQVCT 
AVFE PYNY I LSTHTTLEHSD * AFMVDNE 
AIYDICRINLHIERSTYTNL 


3494 


17395 


A 


3517 


2 


399 


EKIGYNPDTVAFVP ISGWNGDNMLEPNA 
NMPWFKGWKVTRKDGNASGTTLLEALEC 
ILPPTRSTDKPLRLPLQDVYKNGGIGTV 
PAGRVETGVLKPGMEVTFAPVNGTTEV* 
SAERHHE P FRE ALPGDNVGF 


3495 


17396 


A 


3518 


1 


402 


GE DAANN YARGH YT I GTE I IDLLLDRIR 
KLADQRTGLQGFLVFHSFGGGTGSGITS 
LLMERLSIDYGTKSKLEFSIYPAPQVST 
ACSEPYNS ILTTHTTLEHSDCAFMVHNE 
AIYDI*RINLDIERSTYTNLNR 


3496 


17397 


A 


3519 


1 


408 


GS CLGQKLYNSTTKTAT* WSSNHTKKNP 
F S KF P KLQNVWTH PES HRD WTAP TRL YW 
I CGHRP YVKLPNQWAGS C I IGT I KP S FF 
LLP I KTGELLGFP VYASHAKRS IAIGNW 
KDNEWPPERIIQYYEPATWAQDGL 


3497 


17398 


A 


3520 


3 


162 ETVLLCCPGWL*TLGLRQSSCLSLPKCS 
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Amino acid sequence (A=Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=Gfutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop cod on, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














DYRHATVPGQKYRFPNRYYYGSYS 


3498 


17399 


A 


3521 


184 


354 


ETGSSLLP*GSSELPVSHPYFFNTHSLS 
LSHTHTHTHTHTNSS ISNYHCPLYTNMS 
CQRL VAVYRQKG SFKDQCTDI 


3499 


17400 


A 


3522 


2 


313 


Y YLS PPMGDQPGQHGE TLPLQKLAG * GG 
ARLRSQLLRRLRQENCLTPGIQACRELL 
GWYKSNCAFCQQKPQLLCVNLTLYSSLD 
DKARDSVSTKTKREGRGEK 


3500 


17401 


A 


3523 


13 


127 


LGLQARTTSMPSS*GYRRAPPPCPALLY 
F*VETGFHHVGQAGLELLTSTRP 


3501 


17402 


A 


3524 


148 


1 


VCMFLEES CSVAQAGVQWYNHSSLQP * T 
PGLKRSF*LSLPSSRDHRLMT 


3502 


17403 


A 


3525 


1 


98 


VNPGGEGCS EPRSHHCTTAWVTE *DS I S 
KINK 


3503 


17404 


A 


3526 


138 


3 


AAS THDLG * LQP P PPGLTP F S CLRLLS S 
WDYRPVPLRPANFLYF 


3504 


17405 


A 


3527 


10 


206 


PTTSLISFTSYSPCLKCSNVYEQVNRMR 
VLFYKS KQRLGMVAHAYD P S TLGGRGS K 
IT*GQGRRG 


3505 


17406 


A 


3528 


345 


3 


VTEL I QSNKAKL KKKS KNKNEQSLQK I * 
DYAKQPNLRI SGVLEEEEKS KSLENI FE 
KI I EENFPGLTRDLE I Q I QKAQRPGKF I 
AKRLSPRHTVIRLSKVKTKGRILRAERQ 
K 


3506 


17407 


A 


3529 


3 


90 


AVL * ALGF I FL FT VGGLTG I VLANS VDA 
A 


3507 


17408 


A 


3530 


3 


90 


AVL* ALGF I FLFTVGGLTGI VLANSVDA 
A 


3508 


17409 


A 


3531 


354 


3 


AVIKTSLPTKSPGPDGFTAKSYQRYKEEL 
VLFLLKPFQRIEKEGILPNSFYESSVIL 
I PKPHRGTPKES FRP I S LMNLNAK I LNK 
RLAS *NHQHIKKVIHHGQVGFIPRTQGW 
FNKQ 


3509 


17410 


A 


3532 


2 


357 


FTRPRFSFTFFVLIRKYALSCGKISIQH 
SY*KKKSKGKLGGL*GINFCFFERKFHS 
VTQVGGQKHDPG*LKFPPPGFRGFSRLS 
LPS S WNHKRGTPRLANFCTPNKNGGRRG 
RKFQXY 


3510 


17411 


A 


3533 


76 


365 


KKKKKKKKKKKKKKKKKGGAGPEIPKLK 
MGKSEKPPGVFLRGVGEKIKFFFLKKGG 
P FF* KKKPLKNKN" WAREP S FKKI FKKKK 
KKAHFFFGPKILKK 


3511 


17412 


A 


3534 


7 


353 


FFGFGGVFLVFLFFFF*KYPFFFPPPPL 
FFFP FFKWAPRRP PPP FFS S FFKNFFFF 
FPPPPPFFFFFPQKNFPLFFSPLFFFFS 
PKFFFFFFPLTKKKKKKNKKKKTKTPKI 
HPQTQRK 


3512 


17413 


A 


3535 


354 


41 


AGGVP PGI P P P WGGRAGGS PKGRGFG P P 
WP KRGT P P F P F * KKKKKKKKP GGP AI PA 
TREGEGGKS P * PWKPRVQPT * NAPP P S P 
PGEGAKPPFQKKKKSMFHKR 


3513 


17414 


A 


3536 


174 


364 


TFNSLAFTFFFFF*RETGSHFFLEAGVQ 
GNHSSLNP* PPGLRGSS CLTLPKC* DYR 
REPLPPA 


3514 


17415 


A 


3537 


38 


368 


PGRLRLQEAVNHCTLE PGQQS V* KKKKK 
KRGGAFFKAKLTLAGWKGE KKKKTGVFF 
*TLKIRPKEIRNQGGSQQRKKAELGKLQ 
QTYP VLNFKATFYGEQGDYY* TFFKP 
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09/515,1 
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otide 
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26 


correspond 
ing to first 
amino acid 


location 
correspon 
ding to 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 










residue of 

peptide 

sequence 


last amino 
acid 

residue of 

peptide 

sequence 


T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y= possible 
nucleotide insertion 


3515 


17416 


A 


3538 


107 


349 


SVTFPFFFFFLKQTGSHFW*AGGQGGN 
FD*LEPKPPWLRESSLLTLPSSLDYGPP 
PTGPSNFGI FFEKRGFCLVPQASFE 


3516 


17417 


A 


3539 


274 


3 


PRRWSLPLTQMGPLPSRMGPKTKLCPKK 
KKKK I P PRKI LE PD VFTGQF YLMFKE I T 
S I FLKLFQQCEEEGTFSNS F* E ARI ALK 
TK*DKN 


3517 


17418 


A 


3540 


1 


374 


ACALVRNDHIYYWGPTYALGELFFTET 
P 1 1 1 *G*DN* LGPLVIGAHWALTLINY 
ISL*L* PPS VLLLLACAI PE ARAGTG CT 
AYPPLAGDYSHRAAYWLTILSLHLARV 
SSILWDTDSITT 


3518 


17419 


A 


3541 


427 


76 


RGS L S P G V * K * PGGQNQ T F P KAP KLR PG 
APRCFPAGKGSIKPTPGSPPGGRRGTPC 
SKKKKNFRETPPSFFWGGPPFL.KNRGGF 
MFWASKTKNGLNKTLGPNPGPPLGGLRS 
PFWVG 


3519 


17420 


A 


3542 


234 


88 


FFFFFFFFFFFFFFFFFFSLFFFFFFFF 
FFFFFFFFFFFFFW* * ILYFV 


3520 


17421 


A 


3543 


30 


419 


TFSFFPFLFFFGFLVLGGEKQGKWDPKP 
PGAEKPPTFSPPKI*KKGAPPPPRVNFK 
KF * KKGPLP FS PGGKNLKS FWG IWS IGN 
PLWVTPPKNPQKKPPFFKGPKKNFFFKG 
PKPGGKKTQIFFFLWEKK 


3521 


17422 


A 


3544 


159 


411 


VSRDISPVRLSPSSCFH*PFLHISSSVE 
GRFSS FAQFLLS CLNFCTFPLLiFLLLRF 
*DRVSLCHPGWSAVA*SQLTATSTFWVK 


3522 


17423 


A 


3545 


2 


351 


CLAHTSGVES * ASLPCGQNSYIHYLKLL 
CMEELSFPPYLLIYSISYLY*HTIMVIY 
F I LVL * S ILLNL I YLFVTQ I FPALASGS 
SFNWLLCLFNTTIiAVHVCVCVCVCVCVR 
VCAC 


3523 


17424 


A 


3546 


397 


1 


KPPPPCSYGKKKKIFLLVFFRTLPTKPP 
GAGRKI FPRGPGFFLKNPKAL* TTFAPP 
PFIQKKGPFLLQQA*IPFPVFFLFKKKP 
FLCSP*GGIGVSLFFFFFSETESCSVTQ 
AGVQSCDLGSLQPLPPGLKR 


3524 


17425 


A 


3547 


76 


354 


FLFNFMGKIFKNFRGGVFFFFFWGGYNI 
FLVFLKI FFFFFGGFFLPFFFFLKKFFF 
FLGF FFGKI SQKI FFFFVN * I FLLXXGA 
PPLFFFFFL 


3525 


17426 


A 


3548 


197 


1 


INKMKQQRMD*EKrFAKHVSHKGLiISQI 
CK* L IHLNSE KTNS P VRKWAE DLNRHF S 
KRDTKMVNM 


3526 


17427 


A 


3549 


175 


309 


KSTLKVMLISISIFNFYFF*DRVLLCHP 
GWSAVA* SLLTAASTRP 


3527 


17428 


A 


3550 


3 


196 


GFHRVSQDGLDLLTS * SACVS FPKCWDY 
SHE P PR PAHKT S L KKNKMRVR 1 1 IMMT I 
TYPAPWC 


3528 


17429 


A 


3551 


3 


360 


RAVE I FC YDVC VE SGCAD I QLKEjLCPTS 
IKVCFSSLSLDIVTVIPDIYICIYIYII 
FFFFFKQARS*CQCLNDKE*YCSLWPTP 
PGFKKLSCVSLPS IWDLRCAPSHPPNFC 
VFGKNGV 


3529 


17430 


A 


3552 


326 


1 


KDNHRI LKLMLAKSGKVRH YVFDFGKQ I 
S KMSE KNIGVKP P I GKEKTKNPKKKPSR 
TEDPKICTYVL*PDKKKWRRFLNT*GIQ 
LG * KE I KLSLFADDM I VYLENP I V 
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correspond 
ing to first 
amino acid 
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peptide 
sequence 


nucle- 
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location 
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ding to 
fast amino 
acid 

residue of 

peptide 

sequence 


E=Glutamic Acid, F=Pheny1alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


3530 


17431 


A 


3553 


186 


1 


RPRRLP I KLPMTFFTELEKTTFKFI *NQ 
KRVRIAKS IIjSQKNEVGGITPPDFKL* Y 
KDTVTN 


3531 


17432 


A 


3554 


3 


366 


KTSLGNMVKPYNFFV*TFYI**KKEIKK 
NS IGVL * PGTEQS KHE FKKILPS KRNKI 
AS KR I KYL * INLT * EGQDM YTENYKTLL 
KEI I EDLNKYKDI P CS * I IRLDLVMMAV 
LTKLIYRFN 


3532 


17433 


A 


3555 


239 


1 


GRGKMLPVSLLNITRVTFAPCSGSIILV 
L I AL P S 1>R I L Y I TDE VNDP S LT I KS I GH 
QWY*T YE YTD YGGL I FNS Y I L P P 


3533 


17434 


A 


3556 


3 


123 


QAGLEV*TSSDLPTSVSQSAGITGGSHR 
ARPANIINSVP 


3534 


17435 


A 


3557 


237 


2 


KGFSPPTQKFGFLGVFPPQKKKTKKNPR 
PPKGLTPVS*KGMLTEPLEPFFFFFLRQ 
SRSVAQAGVQWGNLGSLQALPP 


3535 


17436 


A 


3558 


87 


379 


KKKKKKKFTLQKPVGFLNP PNGEAKSQ I 
INELLLKIATKKKKNLKLRLKKKGKDLF 
RGNYN PL F KK I KDD P TNGKNF P C S * MG K 
ISFEKMTLLPKAI 


3536 


17437 


A 


3559 


390 


181 


KKKKKNPGGGGPPFFPTFRGG*GGGPPQ 
PGGGGAPKV* ISPPPPPPGGKKSPPQKK 
KKKKKKKGPFLPHL 


3537 


17438 


A 


3560 


2 


101 


VIVTAHAFVIIFFIVIPIIIGGFGN*LV 
PVDAA 


3538 


17439 


A 


3561 


2 


126 


INTLLALLLI I ITF * LPQLNGY I EKSTP 
YE CGFD PIS RRRG 


3539 


17440 


A 


3562 


36 


349 


FAFQK I TLAMEKATEAQ TGGAKKACQGT 
NNIMKFLKKKCIPTTIFWISFFSFLFFF 
FETKPHFFNWPEGKGPNLG*LKPLPL.GL 
KQFCLTLPRKGNYRHVLPPPVNGFFFF 


3540 


17441 


A 


3563 


6 


130 


TLRRILEDHFSLRS *GCGEPCLRHCMPA 
WLTE*DPVSKNIY 


3541 


17442 


A 


3564 


3 


152 


GFHCVRQDLDLLTS * SAHLSLPKCWDYR 
RKPPHPAGKGT YF Y I D YSTI E 


3542 


17443 


A 


3565 


241 


3 


NLVSHVI SMRKVINLVEQNVTSSNRSNH 
TWLS F I KIiYFI 1 1 1 S I VFGEQWFG YVA 
KFFSDDF*DFGVPITQTVYTAPN 


3543 


17444 


A 


3566 


71 


256 


S VFLLTVFL YPLATS FFFLNRVSLCCPG 
WL*TPGLKQFSCLSLPKRWNYRHEPLTP 
GSNFF 


3544 


17445 


A 


3567 


1 


392 


AGAGTSGTDYPPLSGNYSHPGACVDIiTI 
FSLHIAGVASMLGAISVMTTVMNIEPPA 
ITQYQTPLFV* SGRNTAGLLLLCVPVLA 
AGITILLTDRNLNTTFFDPAGGGDPILY 
QHLiKGKRGIIICSHRDRI 


3545 


17446 


A 


3568 


334 


409 


LGTVAHACNPSTLGG*GGHITRSGV 


3546 


17447 


A 


3569 


3 


467 


PKNPPGGFFSPL*EKKYILPPPP* IWPP 
PGFF*KAPPPFFFFFFFFFFFFFPTFQI 
YFITF 


3547 


17448 


A 


3570 


3 


420 


FFDPAGGGDPILYQHLF*FFGHPEEYML 
ILPGFGILSHIATYYSGKKEPFGYIGMV 
*AMIS IGFLGI IV*AHHIFTG* IDVDTR 
AYFTSATI I IAIPTGVEVFS *LATLHGS 
NMK*SAAVLSALGFIFLFTRRGIiTS IV 


3548 


17449 


A 


3571 


1 


418 


NLINHSFIDLPTPSNIFA**NFGSLLGA 
CLi I LQ I TTGL FLAMH Y * PDAS TAFS S I G 
HITRDVHYG* IXRYLHAKGASIFFICLF 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possib!e 
nucleotide insertion 














LH I GRGL Y YGAFL Y * ET * NI G I ILLLAT 
I ATAF IGYVTjPGGQ ISF* GAT VI TNLL 


3549 


17450 


A 


3572 


3 


414 


SSGLAM*FHFHSITLLILGLLTNTLTIY 
Q * WRD VTRE S T YQ GHHT P P VQ KGLRYG I 
ILFITSEVFFFAGFF*AFYHSSIAPTPQ 
LGGHWP PTG I TPLNPLE VPLLNTS VLLA 
S GVS IT* AHHSL I ENNRNQ 1 1 Q ALL 


3550 


17451 


A 


3573 


2 


423 


GTL YLL FGA* AGVLGTAL ILL I RAELGQ 
PGNL VGNDH I YNG I VTAHAF VI I FF I VI 
PHI GG FGN * L VP L I IG AP EMAF PR I NN 
I SL * LL P P SLLLLLAS AI VE AGAGTG * T 
VYPPLAGNYSHPGAYVDLT I FCLHLAGV 


3551 


17452 


A 


3574 


411 


117 


DGGPLLQLLRRARQENCFNPRGGGCS * P 
RLCP C P P AWGAKLNS LSGKKKKKKRHVE 
KYAQHDYI INVLCWCVHI KWLSLYMYS 
I TKS IDKZ CKKAS 


3552 


17453 


A 


3575 


2 


394 


ALANMWEQIRSLHQYAVHRI ISLFSLLS 
KKHDRVLEQATQYLRGSLNTNDVPLPDY 
AQDLT VIEEL I PMMLEI INS *L>TNSJbHH 
IPNLVYALLY*RDLFEQFRTHPSFQDIM 
QNIDLVSVNEDIYYDSFLS 


3553 


17454 


A 


3576 


262 


418 


GWRE WTEL I IFYIF FETTL I PTLAI I TR 
*GNQPERLNAGTYFLFYTLVGSLP 


3554 


17455 


A 


3577 


232 


443 


PSVQTFPCCPLSEEGPWLLCVAMTPSPG 
VPVPTERALYSME CAFHPLFSLTSGACR 
LDYRTPDNR*VQLP 


3555 


17456 


A 


3578 


48 


410 


GGLLLHRAGCWCGHADRRADQPDSSDPH 
S LHAACL P P PGAARAAGAP S PA* TPAAA 
CPAPVPAPCSEHRC*PPGLSQRPCLPGL 
PDARPPGAAAHGGWSLPTRLVLHDRSHP 
EEAPRRHED 


3556 


17457 


A 


3579 


422 


186 


VYSKFL* SQLLRRLRQENRLNPGSRGCR 
E PRS * HC I PAWVTQQDSLS ISSS TSHE I 
CGSHCSPGYLTVHI CDQPVFSL 


3557 


17458 


A 


3580 


442 


3 


GS AI S SRCTQEST YQGPHTP P VKKGLRY 
GIILFITSEVFFLAGFL*AFSHSTLAPT 
SQLGGHWPPTGITPLNPLEVPLLNTSVL 
LASGVS I T * AHHS L I ENNRNQ 1 1 QALL I 
T I LLGL YFTLLQ AS E * LES P FT I SDGMY 
CGRSRG 


3558 


17459 


A 


3581 


428 


1 


IRDIFMASSLGM*FHFTPITLLILGLLT 
T P LT I YQ * WRD VS RE S T YQGHLTP P VQ K 
GLRYGI ILFITSEVFFFAGFF*AFYHSS 
LAPTPQLGGHWPPTGITPLNPLEVPLLN 
TSALLASGVS IT*AHHSLIENNRCIAAA 
LE 


3559 


17460 


A 


3582 


160 


415 


RCLSHCLAFPMLSILFIPSQLLSIiPFHV 
KWIFFFFFLNKKFNCVPQAGGQGKNLI* 
LHPPPRGLKKFSCLRITSS*NYGREPQP 
S 


3560 


17461 


A 


3583 


399 


2 


STHLGL P KC WDYRCE P PRPALLAI VKLL 
CKTIHVRINNVCCLPLLTGAASVIGLSA 
SLLFSVSPGTSLQLEAR*TFPWPHMSST 
CVWRF I FTSLGRAQPQHSDLQTCPRRDV 
DLAEPGLLFNGQMLVPPGRS 


3561 


17462 


A 


3584 


215 


I 


KS PGLLNAFSRGSQI YQASRAMGA* LNQ 
RFWPGTVAHACNPSTLGG*GARIMRSGD 
GDRNSKDGVS PCGRV 
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E=Glutamic Acid, F=Phenylalanine, 
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3562 


17463 


A 


3585 


138 


19 


LNQRFWPGTVAHACNPRPLGG* GAR I MR 
SGDVYRNSKDEV 


3563 


17464 


A 


3586 


262 


3 


GRPEVFYKPPPRGEFFFFFFFLFFFFFF 
FFFFFFVLFFFFFFFFFFFFFFFFFF*Q 
VRLYLLEKYELFFFFLF YPQE I SLCCPG 
WS 


3564 


17465 


A 


3587 


381 


1 


KKFTPPPPRKMGPPPGFF*RAPPFFFFIj 
GGGGGPLFFLGGPPPPGGGFFFSPPPPV 
FF*KSPPLFSFF*KKRGPPFFKRGGSPK 
TP P C F FFKKKNP P KKKKKKKKKKKKKKK 
GRPSS KSDAWVRTRG 


3565 


17466 


A 


3588 


198 


1 


PKS P PRPDARGKE PNF FS KKKKKKS I * N 
LNTNSWLDMVAHI CNPSTSGGRGG* PHL 
RSGVRDQPGQ 


3566 


17467 


A 


3589 


373 


1 


GIFFPPPKRGGFPPPPPKKFFFFPGGFI 
FGGGGGPFWPPPRKGFFLKKKKGVF*SP 
P*KGKKFFFFPGGGVGPPRGFIiKGGPPI 
IFFFFFFFFFFFFFFFFFPATI FLITYK 
DLTCHPGTHFTH 


3567 


17468 


A 


3590 


506 


98 


HTSRGPPPPPKKKIPPTRPFFFCCFWW 
FFPPKKKFFFFCSPRSFIHSPPKKKKKN 
FFFPKGGGPPLSPFLTPPPPLFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFF*LFVSLFS I 
FVLLIFFLFYNAITTI 


3568 


17469 


A 


3591 


407 


1 


LVRGQ ITNKKNLFNFLGEELRE WRWFF 
LPQQRLFFFKL*LQ*KEIYLFI*KMTCS 
RGGAGFKSSLYFFFRSPPQWIiFFFFFF 
IFFYFF*FFIFFFFFFFFFFFFCNDFMI 
YL Y I S PPLP I LE KKKGGRSRS RS 


3569 


17470 


A 


3592 


380 


68 


FRRGVGPQWPPPQKKGFSQKPPGGFKRP 
PLKGKKITFPPPGKFGPPKGPFKRPPPF 
FFLEGKTLFLGPDP* *NLGWGG*RGHGK 
* GPANPRVPP VFF F FEEYS Y 


3570 


17471 


A 


3593 


394 


45 


ALM F S TGQ EGRQE KH P P P HQ KKKAS RGE 
E PRGG VRKNY P P P KKKS F P KKNP P GVT P 
PKKKKKAKLPPRPL*GGPQRRPQKTAPP 
PLNFYPPKEPTKVLSIFFSFFFFFSAND 
VELY 


3571 


17472 


A 


3594 


402 


62 


AMIVLLHSSLGNKARPCLFKKKWAKDLN 
KHFSKEDSQMAISrKYMKRCLISLVIRNMQ 
MK I T VRCH F I L TRMAKI KKMVNS KCR KE 
VQKI P T L KHCWWE C K I V* I LWTNLAAR V 
G 


3572 


17473 


A 


3595 


220 


425 


FISGFLNLQEEREIAFFFKVNIRLGAW 
HTCNPSTLGGRGRRIT*SPGV*DQPGQH 
GEPCLYLNLQKN 


3573 


17474 


A 


3596 


13 


412 


AEIAPLRS IHRARNCLKKKGRKGRKNPG 
GDGGNPQIGPAFFGGGNGKTPGQKPHQP 
QTWGQGRHLKGGGGGPKRKGVPPKKKTP 
T I WGGTPLAFG VE PNW WGHTGNPGGADP 
KPPGREK* SGGAQ I TGPHTGP 


3574 


17475 


A 


3597 


2 


293 


CQLSP*ELGTPRERERERERERERERER 
ERERLPPRDLREKRAQDVFATHPNYEFV 
CVIRHYRGCWYTPLTTAHS IDAHKMCV 
CVKKRPLYKKKKK 


3575 


17476 


A 


3598 


243 


9 


ILLRQEKQCWSKGTRFQLQDEQVLNT*D 
RTWE VMWPGAVTHACNP STLGGRGGRIM 
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RSRDRDS CR I HQGGSKP KS PR 


3576 


17477 


A 


3599 


1 


329 


PSARRDWKNI PKLPPPRGKNPTGFFFLK 
PGGPQRGPGKKGGKNF * S FGKPF*KGGG 
PPQFCPPPPRGGFFFFFPGGPPGEKKGG 
VKGPPPPFFFFWPPPPPNLFFPPGA 


3577 


17478 


A 


3600 


206 


1 


LRALG I MERAAHTE VRLVF I Q I LGS PL * 
YIF*RQDLSPSPRLECSGTIAARCSLDL 
LSASNSPASASQ 


3578 


17479 


A 


3601 


322 


99 


KRTGAFGFFFPPLFNKKKKVFLGPEGPK 
IVKKKKKKEKKEKQNKNRKQTKKK* KTN 
NKQNKKRETD I FGVEDET 


3579 


17480 


A 


3602 


410 


25 


LFFFKKKTGGGGGGPPPFFHPFGGVNGG 
VIRVGELSPPCPPGGTRFFFKKTKNYPG 
G*GGPFFPPPSGGGLGPKKTLSPRGGAS 
L*PNFGPSLPPKGKGGPPFSKKKKKKKK 
QSRPRDS*SLRRGIRK 


3580 


17481 


A 


3603 


27 


349 


RSSIQCGGILLFYFLYFMVFNDIFWL* 
GFSAITSILISMLFSCI*YYFFKMYCLL 
FNFRCHW I FY I LF * MKT I AQ I GN* I F FR 
DAYLGYFFLFLFFFFFFGGKLYL 


3581 


17482 


A 


3604 


88 


435 


RAISTCLQNEQYKWLTTITSVEKSLNDL 
TELKTMV*ELHDKCTSFSSRFIQLEERI 
SVTEDQMNEMNNPPLIHGFIFYGSSYSW 
STMVRKQDKLRVKDAG I KHRNQNKGFTA 
EKSA 


3582 


17483 


A 


3605 


331 


428 


YGH YG Q I EAT F L PA I TT E KE TLKTRKL T 
*SSINLLK*T*NVRPGTVGHACNLSTLG 
GRGGQIMRSGIRDQPNST 


3583 


17484 


A 


3606 


3 


294 


GG*GCSEL*SCHCTPSSLG*NTVRKKER 
KKGKREREKERKRE I I PGFGPCAHHKGA 
VTLGNLFPRVQVSQLVQRRKI I I TVI PD 
RLGEIVLVSQSL. 


3584 


17485 


A 


3607 


263 


426 


RKIiAGTFPFLWKSGPFQTILAAAQRMGG 
NDPKETRVFHAGLS S SLGENK* VWKI 


35S5 


17486 


A 


3608 


213 


439 


LSWACAAEWTVTQPANFPLDYILPFFLL 
IRESSKF*RHMATQLETAASRPGVVVHA 
CNP I TLGGRGGR I TRSGVH 


35B6 


17487 


A 


3609 


2 


441 


MTDLESFPLESELSANSPVQCSGLCKPV 
LT I FQVI SHKGHCHS KY YLLNAN YNVFL 
HLSNSVLPRLLSGQRTETQKHFYTAVLS 
VCLS IYLSRDRVLLCYPGWSAVM*FYSL 
*PQIHGLKQSSCLSLPKC*DYQHGISPP 
WCN 


3587 


17488 


A 


3610 


102 


354 


EPGAWADRCWLGRVLLTHLLWPCCLYFF 
YF 1 1 FILNSGDRVSLCCPRWS CLK* SS C 
LSLPE CWD YRCQPLHPACAVF I ENSKR 


3588 


17489 


A 


3611 


462 


371 


GLNFLTL* SARLGLPKCWDYRHEPPRPA 
WS 


3589 


17490 


A 


3612 


463 


319 


RNSHDGLELLTL* SAHLGLPKCWDYRCE 
PPRPVliC 


3590 


17491 


A 


3613 


49 


476 


PRNS I TLLILGLLTWTLTLYQGWFDVTR 
ESTYQGHHTPP VQKGLRYGI I LF ITSE V 
FFFAGLF*SFYHSRLAPTPQLGGHWPPT 
GITPLNPLEVPLLNTSVLLASGVSIT*A 
HHSLIENNRNQLMQALLITIIiLGLYFTL 
I>Q 


3591 


17492 


A 


3614 


223 


467 


TPS * PLAI I * F I STLAETNRTPFDLAER 
ESELVSGFNIEYAAGPFALFFIAEYTNI 
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nucleotide deletion, V=possible 
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MI INTLTTT I FLGTT YDALS PELYTT YF 
VTKTLLLTCLFL* IRTAYPRFRYDQLIH 
LL * ENFLPLTIiALL I * YVS IPITISSIP 
P 


3592 


17493 


A 


3615 


2 


405 


ARGLQ E AGAVQND P VSRL FDHAMLQGHR 
AHQMAI VTYQEFGETYI PKDQKY* FLHE 
SQTYFCFLNS I PTPS SMEETQHRSSLEL 
XjRISLLLIESWLEPVRFLRSMNANNLVY 
DTSDRDDYHLLKDLEEGIQTLM 


3593 


17494 


A 


3616 


328 


63 


PSKF*KPCLISPPPLGPGPPN*TPGFLK 
KPKFPPRGGAHP* S PLLFRVRPGDPLNP 
RGQGFP*PKLPPGPPTRATKPNFVSQKK 
KKKE 


3594 


17495 


A 


3617 


1 


379 


GTSVGI PTGGQEFS * FAALHGSNVK* SA 

aal*alrimilftvrgltgiglanss:ld 
i vlhd td yve ahfh yvls i gavfai i gg 
fih* fplflgytldqtyakshfti i fig 
vsvtlfpqhflglc 


3595 


17496 


A 


3618 


494 


17 


GSANPRNPRRSGSKTRETSPRlSrVMSLMS 
PMSPTLPMRPTSPGTPMHLTNSMGPMQP 
PNSSRPVNPASSATPVSPASPARPRKPS 
CFPAPQ* * SPPL* S PEPRHGLS CL.SWVK 
KECETWHWSMHI VEDRHS KPYHGPGMV 
AHACNPSTL.GG 


3596 


17497 


A 


3619 


350 


480 


IjGS WD YVCKP PHLAVLLL * RQSFVLLPR 
LVTNS * AQVIHLPWP 


3597 


17498 


A 


3620 


2 


400 


AAAEPHGC YAKVLDE FKPLGEEP *NL I K 
QNCELFEQLGEYKFQNALLGRYTKKVPQ 
VSTPTLVEVS *NLGKVGSKCCKHPEAKR 
MP*AEDYLSWLNQLCVMHEKTPVSDRV 
TKLLTESLVNRRPCFSALEVK 


3598 


17499 


A 


3621 


3 


476 


LGYS VH P P QP LAQVLi S P PL P GAS RAGWL 
LRMRGP PS PHP PRTPAGLQAPHAALVPA 
RASPSTPPCKLREWAPALASPERGSHSA 
VGG*RAPQMPPKWEPRQGRCQEQARALR 
TASMLSPLSFTHPVTLSLLNFPTSLPPR 
NSPSWPWSVR 


3599 


17500 


A 


3622 


40 


390 


MKGALTQPGRAGGLGSQSQMDMASTPKP 
SPDDQGCSIHLWGFPFLVSLLPPSLSPS 
PPSPPRHSVGTHCAP*VLPRPLGALRVP 
HSSVPRPHAVPWGTTYRYSRGAWPLSL 
DFRGL 


3600 


17501 


A 


3623 


52 


475 


AVEFHLLFLQPLETTNLLSVCMDLPILA 
NSPK* NQ 1 1 CDLF VFFHLA* CF * DLFML 
EHVS VLHFMAE * FFF F F F * KGVS L FS P G 
WRAGAQFWLTKTPPHSPARLKQFSCIiTL 
PGSWNYRRAPPSPANFFVFSGKRGFTIL 
P 


3601 


17502 


A 


3624 


333 


466 


S I IGKIRLWKMRPWPGAVAQACDPGTLR 
G*GKRITRSGVRDRPG 


3602 


17503 


A 


3625 


2 


138 


LRRGNRLNPGGGGCGEP* SRHCTPASST 
E * DS VS KNQKTKNS QWH 


3603 


17504 


A 


3626 


3 


337 


LSLRPPALPPCPPRRLPTPGQGHPGSAG 
FPGRPTL*RLPPLPDPPPSLGLSPGDDP 
GIiAREELKS PCP WGPHPLCHLPFP * CWA 
S S QPS INAYEWRKKKKKKKKKKKKKKK 


3604 


17505 


A 


3627 


2 


393 


LGTDLSLIi IRFELGQPGNLLGNJDHI YNV 
I VTAHAF VI I FF I VI P 1 1 IGGFGN* LFP 
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Q=Giutamine, R-Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X-lJn known, *=Stop cod on, /=possib!e 
nucleotide deletion, \=possibIe 
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LIIGAPDMALPRINNISI,*LLPPSLLLL 
IASAIVEAGAGTG*TGYPPLAGNYSHPG 
ALA/DLT I F YLHLARVCS I 


3605 


17506 


A 


3628 


2 


379 


QDATYPI IEELITFHEHALI II CLI CFL 
ALDALFLTLTTKLTNTNISDAQEIETV* 
TML PA 1 1 L IL I AL P CLR I L YI TD EGND P 
SLTIK*IGHQWY*TYEYTDYGGLIFNSY 
I LiP PLFLKPGDLRIj 


3606 


17507 


A 


3629 


1 


400 ■ 


PTRPPTRPPTRPMTNTLNSIHPPLPR*K 
KKKKKKKKKKKKKKKKKKKKKTAAGGGA 
* KKKNRGAKTHRGSKKLNFFFKKRNKKN 
PPVKIEKKTFFLGGKKKNKPPQKNNALK 
KKKNYFEGEGESLLLL 


3607 


17508 


A 


3630 ' 


416 


1 


SQLGYSGVRDPLEEATCLFSDIiKLHAGR 
TTPLFQAVRQGHL I LQRFLQP FA* LCPA 
PRGGVYRGRQASLS CGGLHPVRASWPLC 
LPTQFSAMEGAPPLALLPRSSS ISDYCA 
SNEQGSVGIGPSEPCTGYNLLVCHLL 


3608 


17509 


A 


3631 


275 


83 


RWGSCYWLAGLQLLGSSDSPAIiASQSA 
GITGVSHCTQPKNVI*KISHLTVAGPAA 
DKTPQTPR 


3609 


17510 


A 


3632 


186 


3 


FFFFFFLFFFFFFFFFFFFFFFFLFFFF 
FFFPFFP FFFLVCQVPLLCNL* HLVYLF 
YMVL 


3610 


17511 


A 


3633 


3 


235 


G 1 1 S FLL I R* * YARADANTAAI RAIL YN 
RIGDIGFIIiAIA*FILHSNS*DPQQIAI 
L PHP S D S KKYL YL S HI Q LKKA 


361] 


17512 


A 


3634 


1 


428 


RTLKTTLFNPDGG * DP IL YQHLF* FFGH 
PEVYIIILPRFR*ISHIVTYYSGKNEPF 
GYISMV*AMISLGCLGVIA*AHHIFTV* 
I D VDTRAYLT FAT IMIAI P TAVRALS * L 
APLHRSN I K* SAAKL * ALRL I FLFTVGG 
QT 


3612 


17513 


A 


3635 


2 


129 


PVIYSTIFAGTLITALSSH*FFT*VGLE 
I NMLAF I PVLGFWD 


3613 


17514 


A 


3636 


2 


422 


DR*LFSTNHKDIGTLYLLYGA*AWLGT 
ALNLLIRAELGQPGNLLGNDHIYNVIVT 
AHAFG IILFIVIPIII GGFGN* LDPL I I 
GAPDMAFPRINNISC*LLPPSLLLLLAS 
A I ME AG ARTG * TAYP PLAGNYSHPGAS V 


3614 


17515 


A 


3637 


14 


479 


QNCKIMNFPM YFLPHQSE FPRKS CKQQN 
T I CFRLSNQNTMKKTTENHALLW I RKWQ 
T* KQQCKSNSGYTPSTPSRAKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKPGGGV 
LKKL 


3615 


17516 


A 


3638 


2 


449 


1 1 1 ICFVGLCALVLTLTTKLTNANILDA 
QEVETG*AILPAIMLVIIALPCLRILYI 
TDEVNDPSLTIK* IGHQWY*AYEYTDYG 
GLMFNAYILPPLFLKPGDLRLLDGDNRV 
VLPNEAPIRIVMTSQDVMHS *AVRTLSL 
RTDAIPGRL 


3616 


17517 


A 


3639 


347 


98 


HFFLGGGKGGGFS P FP PQKFFFFLRGFF 
FGGGGGPNFPPPKKGFFSKKPQGVF*TP 
P KKKKKF F F FPRGF WG PPG I FF KG P P P 


3617 


17518 


A 


3640 


2 


426 


DR * LFS TNHKD I VTLYLLFGA* AGGLCT 
ALSLL IRAELGQPGNLLGNDHI YNVIVT 
AHAFVIIFFIGIPIIIGGCGN*LVPLII 
GAPD3yiAFPRINNISF*LLPPSLLLLLAF 
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nucleotide insertion 














AIVEAGAGTG*TVYPPLAGNYSHPGACA 
D 


3618 


17519 


A 


3641 


94 


294 


GRGGGGVAGGRGAMWSKGRQRGRNGPVR 
GREGGRKSHFRKEG*KEGRKEPPPPPKK 
KSRGEKNQREG 


3619 


17520 


A 


3642 


1 


405 


RTRGFDLERFVLFDVAQQMVESGYVCEG 
DHKTMANAI I DRVS L I KRKREQRQL VRE 
EQEKKKQEESSLKQQVEQSSASQTGIKQ 
LPFADTGIPTASTTSASVSTQVEPEEPE 
AD*HEQLQYQQPSLSVLSDGTVD 


3620 


17521 


A 


3643 


34 


405 


HLLGRKQNKAIFLEAEKKKKKKKKKKKG 
G P LKKNP GGAKNURGGE KK I F S P * GGVK 
KTPLG I FEKKP YFGGGKKWANPPKKI KT 
IiKEKKKF*RGKGGKKPENPWGEKDESSS 
PK 


3621 


17522 


A 


3644 


430 


81 


EFAQLSPPQIKGFPSPPPNFLPPSLFGF 
FSPFSP*KFFFSLKGFIFVGGFFPFFSP 
PKKSFFSKNPQLVFISPPFKKKIFLFPP 
PLNFGPPRVFFKRPPPFFFFFFF 


3622 


17523 


A 


3645 


399 


2 


AFFFTKKKKEFFPPFPFKNFFFFSRVFF 
FLGGFFHFFPPQKKIFFLKIPRGFFLTP 
PKKKKIFFFFPR*FLAPPGFFL*APPPF 
FFFFFFFFFFFFFFFFFFSPCSWSCHQV 
FPHASNRIHNPSNSYPLQQY 


3623 


17524 


A 


3646 


49 


326 


KKKKKKKKKKRGGPFKKTPGGAQKNPGG 
EKKF F FLi * GGHKKP PRG I FKKKP FFGGG 
KNWTTPPQKNKAFGGKKKFLGGKGEKPP 
PKPRVKKN 


3624 


17525 


A 


3647 


327 


94 


QKKKKKFFPPPPPKNFFFSPRGFFFGGG 
GGPFSPPQKKGFFSKNPGGVFLPPPKKK 
KIFFFPPGGFGAPPGFF * RGPP 


3625 


17526 


A 


3648 


2 


301 


TSSVASTFIMSLFPTTIFMCLDQEGIIW 
N*H*ATTQTTQLYLSFKLDYFSIIFIPV 
ALFVTWPIIEVSL*YINSDPNINQFFKY 
LLIFLITILILVTGC 


3626 


17527 


A 


3649 


2 


406 


S TNHKD I GTLt YLL FGA * AG I LGTALNL L 
I RAE LGQ PGNL VGNDHI YN VI VTAHAFG 
IIFFI VI PI I IGGFGN*LVPLI IGAPDM 
EFARI3SnSTISF*IiLPPSIaLLLLGCAIVEA 
GAGTG* TVYPPLAGNYSHPGACV 


3627 


17528 


A 


3650 


3 


418 


HAYADAWVHFHS I TLL I LGLLTNTLT I Y 
Q*WRDVSRESTYQGHHTPPVQKGLRYGI 
ILFITSEVFFFAGFF*AFYHSSLAPTPQ 
LGGHWPPTGITPLNTLEVPLLNTSVLIiA 
SGVS I T * AHHSL I ENNRNQ I I QALLI 


3628 


17529 


A 


3651 


526 


2 


WGGGEKGRAGGAGEETPHQESQRQGCAL 
PQEPPLYPPCPAIHPPPLPLGGLLLFPS 
LPPLP*AAWFPHPGQPGRNLNGESPTTR 
EKRLWGPPLPTPLPWPPTQPTPLSRRTG 
CPPCPREVPTPSPPWAEPPPLPGPPSEI 
AHLTTVRRGAALANVNHENISDTDENKli 
NAFVTA 


3629 


17530 


A 


3652 


464 


297 


RNSFHRVSQDDLDLLTI^SPCLGLPKCW 
DYRCEPLLPASYIDVS 


3630 


17531 


A 


3653 


162 


467 


NSKKQQFVLLSEITSYVYWLHKRDLDSH 
T I IMGDFNTPLSTLDRSARQKVNKDIQE 
LN*ALHQADLIDIYRTL,HPKSTEYTFFS 
APHCTYNFLFQSYFSSQF 
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SEQ ID 


M 


SEQ ID 
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Predict- 


Amino acid sequence (A=AIanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=GIutamic Acid, F=PhenyIaIanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=Iso leu cine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


3631 


37532 


A 


3654 


372 


1 


I/MMLS I FS C VHIAF VCFL WRREMS GVS L 
CCPDYSGAAAVHRCNHTALQPQTPGLK* 
S SCLCKRHLQLRPQ VYTTMLGYLFLFLF 
IjRQSCSVAQAGVQWHDHSSLQP* TPGLK 
RSNS PTLV* DLD 


3632 


17533 


A 


3655 


444 


341 


VGQDGLLDLLTS * S ACLGLPKCWDYRRE 
PPRLAY 


3633 


17534 


A 


3656 


466 


315 


VGHTGItE L PTS S EL PA * AS E SAG I TG VS 
HCTQQSGCAFCYSTWXPTKPYTC 


3634 


17535 


A 


3657 


40 


276 


EFDTAHAFVIIFFIVIPIIIGGFGN*LV 
PIiIIGAPDMAFPRINMISF*LLPPSLLL 
IiLAS AI VE KKKNE KKKKTGAPL V 


3635 


17536 


A 


3658 


351 


470 


FTTLPVTSASLPLKTLAWPGMVAHACNP 
STLGG*GGRIT 


3636 


17537 


A 


3659 


40 


182 


GIRKENHLNLGGRGRIEPGSHHCTPAWA 
TK*DSVKIKKTRIQKTPLI 


3637 


17538 


A 


3660 


160 


461 


INGKDTDTPGMVAHAYNTSTLGGGGGGI 
A*GQEFQPSLrRN 


3638 


17539 


A 


3661 


145 


437 


YFSLSLCLLELMNSVILFFFLVLERGFY 
PPPLVEMQGPNFG*LNPPPPGLKQFFCL 
T LP KS WNNRP P P P RGGNFLD F * KKRG FN 
YVGRVDLKLRTSG 


3639 


17540 


A 


3662 


255 


451 


LLSFNPFFFFFFFGKGGPPSLPPGGKGG 
GLNFI*TLGPQGKRNSPPQPPGEPG*RG 
PTPHPGLFF 


3640 


17541 


A 


3663 


185 


66 


PLEQPLRK*r 1 YLFRDGVLLCHPDWSAMM 
QSRLMATSAEF 


3641 


17542 


A 


3664 


42 


193 


FQLLTRLRQENHLNAGGTSCSEPRSHHC 
TPA*VTETQSQKQKVKVLGFSL 


3642 


17543 


A 


3665 


216 


2 


KSKGKCSFLHSRVNRGNSKELLFLRDGV 
WRYCP AWS * TPGLKQS PHFN I P KCWDYR 
REEFHHPGLWYEFE 


3643 


17544 


A 


3666 


43 


267 


GRLWSAMTPGKLKTLCKIDWPALEVGWP 
LEGSLDRSIiVSKVWHKVTYKPRNPDQFP 
YRDT * LrELVlrDP P P P THS G 


3644 


17545 


A 


3667 


212 


2 


SQLLGRAEARGWLEPRSLRKINKTSQVW 
WYTPVLPSSCDYRHTPPRPANFCIFSRD 
GVSPC*PGWSRTPD 


3645 


17546 


A 


3668 


104 


334 


TVLFPPCFLLHIDFCMVLDFFSNFFVLN 
YF*NYRRVQKMYEEFPFFLFFFLIFSFS 
FLFFLSTFNFFLCPLCDLWYH 


3646 


17547 


A 


3669 


369 


260 


DGLDLLTIi*STHLSLPKCWDYRHEPPCP 
ANMPSFKN 


3647 


17548 


A 


3670 


111 


342 


NGVS F FP P KKGG I WANGN FR F P P KPGG * 
RNFLA* PPQKFGMAGPPPPPQPVGIAGP 
PPHARE ILEFFFFFFKKKGV 


3648 


17549 


A 


3671 


377 


3 


FAIIGGFIH* FPVFPGYALAQTYAKIHF 
T 1 1 F I GV3STLAFF P QHFLGLSGMRRRY SD 
YPDAFTT*NIIiSSVGSFISLTAVILIIF 
MI* EAFAS KRKVL I VEAPS INLE * L YGC 
PPPSHTLEDPVY 


3649 


17550 


A 


3672 


39 


348 


AKIAPLNS SLDERMRVS KKKKKGHGLTP 
KPFLKKKLGKNFPLSQLEGVGKKNQKPF 
GFKKGLEIiAPFNGKGNGENPNTWRWG*T 
MSFFFFF*DSHPFIQAGVQWRDLGSLQP 
RPPRFKRF 


3650 


17551 


A 


3673 


327 


3 


SKDPVFFICVFKKNPPTPNIGVFPFALPL 
KRGQFQPLFNPKGFFFFFFKPVQLGRGK 
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nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ISPQFFFKKRFWG*TKSFFFFF*DSHPF 
IQAGVQWRDLGSLQPRPPRFKRF 


3651 


17552 


A 


3674 


379 


267 


EPRLS P CT PAWATE * DS I S KQNKTKKP K 
IiALNKKKMF 


3652 


17553 


A 


3675 


2 


270 


FCRDRVS LYCP VRS * TPGFKGS S CLDLP 
S SWD YRHAP P GLAK I GGLHVF CL P C CGS 
L.LVCWS SHCSLAHQELLLENKHTDVWMG 
GWMDR 


3653 


17554 


A 


3676 


3 


54 


P VYKNKNKRPDAVAHTCNPS TLGGRGGQ 
ITRSGDRDHPG*HL 


3654 


17555 


A 


3677 


291 


30 


YSLVLLIFFFYCFSISYFISLMNFCHNL 
YYFLLLDFGLVCS YFPS ILWHNEI YF IG 
SMFFIFSVIV*IFFCWEMSCNIYKLHQE 
MF 


3655 


17556 


A 


3678 


1 


298 


RDFSASASQVAGTTGMHRHAQL I FVFRV 
CVCVCVCVCERERERERDGVLLCRPDCC 
S VAHT * VQGRDLGS LQPP P PGFKRFFCL 
KLPSSWGHRCWDAP 


3656 


17557 


A 


3679 


20 


376 


VPP PHLANFLNF I LL F VDTGS Y YVAWAG 
FKLLAPCNLSASNS * S VGITGVSHHALL 
LIiALKCNQLFQFVIFILLHAVCIYDFIM 
VRFSLQCFPQFYDFPFCFILFVSSKAQF 
CFLRLPY 


3657 


17558 


A 


3680 


110 


334 


NTTDYFKLP* ILSGTR*DMYKQPL*HLN 
TVRE YTSKYGWRPGWAHACNPS TL,GGQ 
GER I TRS RD QD Q P CQHGET 


3658 


17559 


A 


3681 


326 


511 


LTMMLQMKI S VI CCLLRN I FWPGAIAHA 
CNP S TLGS * GGR I AR S RDL ENS P S LiLGS 
QPSS 


3659 


17560 


A 


3682 


448 


248 


AKDLNTFFTKE*TPLANKHMKKGSARFT 
YRNVQ I QTT*RCLGT PTRVTGV* MRNS A 
RHR* GCGGLV 


3660 


17561 


A 


3683 


486 


176 


PAPINPPQLRSLVQRSGIRFC*AGQVGF 
KIjL VS S D P P TLAS Q SAG I TGVS HQTWP * 
NYS *NSGMPTHPRQAS S WRLREQWYHVS 
LLPCAYYKNGRQNSLLI 


3661 


17562 


A 


3684 


238 


27 


NKHILCYVGQEMGMGWGSHSHGWKPHFK 
S FQVCP S AVAHACNPS TLGG * GGQ I TLV 
QPGKHGKRRIPALF 


3662 


17563 


A 


3685 


1 


299 


L I TQTCHVGCLKI KCI * TSKKKKKKKKK 
KKKKKARGPLKKKGPLKTPEKPPGGPFK 
APPFWGGPPPRFFFGGGGAPPPVFLKKK 
KKNS PLWGGGGFFFR 


3663 


17564 


A 


3686 


494 


91 


PLGKRFPGGDIALTKLSRPWAWAPPGI P 
PPLGGPGGGVP*VTPFKPTPFRPGEPPL 
PLKNQTFFGVPGPPPYFPPLGGLGPEIP 
FTPGGPGSIKLAPPPSPPPGAPKRNFFP 
KKKKEVMKMLCFHFNLLVQNI F 


3664 


17565 


A 


3687 


17 


249 


YSRFGRHFGHDPGLYFQILRRLRWENHL 
NPGGRG C S KL * S HHCT PAWATE * DP VS * 
KKNVTTIAKRWKQLTCPLADG 


3665 


17566 


A 


3688 


296 


461 


FWLWVS S VLKHMNWPGWTHACNPNTLG 
GQGE * SA* ARV* DQPG*HGETCS AKEK 


3666 


17567 


A 


3689 


149 


422 


ASGRQGPDRGCWGGQAPRHPARGAKPAA 
DMGMSGQGYTKQAAP *GDPSQRAGVRLP 
REPTSRKASGGSGGHSRASGGAVPGQRV 
AQPSPAR 


3667 


17568 


A 


3690 


1 


415 


GDPAGGGDPILYQHLF* FFGHPEVYILI 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=XJnknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














liPGFGIISHIVTYYSGKKEPFGYIGMV* 
AMISIGFLRFIE*AHHIFTVGIDVDTRA 
YFTS AT 1 1 IAI PTGVKVFS * IiATLHGSN 
MK* SAAVL* AL* FI FLF TVSGLTGI V 


3668 


17569 


A 


3691 


1 


393 


GTRKYSFLHDSQTSFCLSNSIPTPSNME 
ETQQKSNLELLRISLLLIESWLEPVRFL 
KSMFANNIj VYD TS D S DD YHL LKDL * EG I 
QTLMGRLEDGTRRTGHILKQTYSKFDTN 
SHNHDALVKNYGLLYCFRK 


3669 


17570 


A 


3692 


402 


2 


FLTQRWGAPLP I P P FWEAP FGP S P* APG 
FKP PLP PRGNP P FS * KTQ I YP ALGARP V 
IPPSLGGGGGESFLPPGGGFP*PRLFPC 
PPPRGAKPNFFFQKKKKKCLFCKAPQAQ 
HRSACLHECLPLGNLLKLCPR 


3670 


17571 


A 


3693 


3 


434 


I FY I F FGTTL I PTLAI I TR* GNQPERLN 
AGTYFLF YTLGGS LTLL I AL I YTHNT LG 
SLNILLLTLTGQELSNS*ANNLI*LAYT 
IAFIVKIPLYGLHL*LPKAHGEAPIAGS 
IVLAAVLLKLGGYGI IRLTLILNPLTKH 
IAYP 


3671 


17572 


A 


3694 


452 


208 


INKSARHGSMNL * S QLLRRLRE VCLS PG 
VQGCIEPWSHHCTPAWETE*DKTLSQKQ 
TNKKTCQHLYDHRPSALEATFRLSH 


3672 


17573 


A 


3695 


464 


146 


LKQTRATKGNLFLP * KPKLTPPGGFRPP 
FQPFWKPRPLGPPKSGVLDPPGPQGKTP 
FVLKKPNLLGGVIPTLFFPLSGGLGRKI 
P S PRGGGGAKKRKP PRQKKKKK 


3673 


17574 


A 


3696 


2 


465 


VTRLRERPALLVSS TSWTEDEDFS ILLA 
ALEKFEQL TLDGHNLPSLVCVI TGKGPL 
REYYSRLIHQKHFQHIQVCTPWLEAEDY 
PLLLGSAEPGVCLHTSSSGLDLPMKWD 
MFGCRLPVCAVNFKCLHELVKHDENGLV 
FEDS * ELiAAHLEKL 


3674 


17575 


A 


3697 


173 


51 


TFISWFGEQWFGCMEKFFSGr>F*DFG 
APITQG*SFLSI 


3675 


17576 


A 


3698 


380 


26 


KQRTNFFLFPFKGTGAFWFFFPPFFNKK 
KKVFLGPGGPKIVKKKKKKEKKENQNKN 
RKQTKKK* KTNNKQNKKRE TD I FGVEDE 
TWRQRWGLGGGRVANSSAEMGKGNSTTR 
PSWNS 


3676 


17577 


A 


3699 


11 


414 


RLWTAI L I QTP * S FTGAVI L 1 1 AHGLT 
S S LLFCLANS QQKKKKKK 


3677 


17578 


A 


3700 


1 


401 


S S SLTLSNNNKS FLNQ FVT FNEMW I LYD 
NW**PAQWLDQEAPKHFPKPNLDQKKVM 
FAVWWSAAGLIHYCFLTPGETITSEKYA 
Q Q FKE MNRKLQP LQ S AM VNRKGP I LHDN 
A* SHITQLKLQKLN* LGYEVL 


3678 


17579 


A 


3701 


395 


140 


TP PKKKKKI FPPPPKI FPPPKKYKKPPP 
PFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFSIYIDFNLCICDNL*LQKTLVFL 
E 


3679 


17580 


A 


3702 


407 


371 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFLWYKDIIIKGKNEQNV*CS 


3680 


17581 


A 


3703 


36 


409 


LLKCE G I WKKEGKKKKKRRE KKGGP FFR 
VLLGGP T FRGGAHKKS LLFRG VF FNLGG 
FFFEGPFFLGGALLGPPPPP I FTPLGKK 
KIFKGKRGPPPLIFLFF*IRGPPPPPSP 
PSTKPPPPPPPTP 
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E=GIutamic Acid, F-Phenylalanine, 
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N=Asparagine, P=Proline, 
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nucleotide deletion, V=possibIe 
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3681 


17582 


A 


3704 


56 


415 


QAPWGKKKKKGKEKKKKKKKKKKKKKRG 
GPL KKNPGGGP TMGGGE KKYFS PWGGEK 
KNPGGNLGKKTPLGGGGItfWAPPPPKKKG 
AGGKKKI *RGGGGKTPLIGGGKKNGGPP 
RTSPR 


3682 


17583 


A 


3705 


108 


1 


NLAK*IQ**IQTIMHHDQVSFIPGIQGW 
LNILKSIN 


3683 


17584 


A 


3706 


425 


52 


PFPH* * *L* KVDP I PLPPQNFLHPSNNI 
YFSPLSPKKFFFSLQPFFFFGEIFPIFP 
PPKKNFFS*NPNLVFISPPLKKKIFFSP 
PPPNFAPPKNLFKSPPPFFFFFFFFFFF 
FSGKNTLYWVDK 


3684 


17585 


A 


3707 


117 


421 


VLLVIQAEYLLPEVFGTRSVSDFGFFRF 
WNICIILSG*ASLI*KSEI*NALKNISS 
FKKKF I F I IiE PRSHS VTQAGVQ WQDHGS 
LQPQTSGLKLFSHLSDP 


3685 


17586 


A 


3708 


308 


3 


RSPPWFFFGFFFFVLFFFFFFVFQLFFS 
KYN I TL Y I KKKTE T FTE TNDNGNAT CQN 
LWDTAKAI PREKF I AI STY I * KVGKHQ I 
NNLMKHLKEIEKQKQTK 


3686 


17587 


A 


3709 


94 


420 


KRKKKKKKKGGGAFKKKPGGAKI *RGKE 
KKNFFLKGGGKKKKVGNFGKKPYFGGGK 
KREKTPKKN*SLKGKKKFLKGKGGKTPP 
KPWA*KKWSS 


3687 


17588 


A 


3710 


413 


184 


LGECLSLRSKKLWSGPAGGGVCIj*FQQV 
RRRKWEDHLSPGIRG*RALSSHRSIPAW 
TAEQDPVSRGLVWGLPGGSW 


3688 


17589 


A 


3711 


3 


104 


GGCSELRS CHCTPAWRQS * TS SQRKKKK 
KKKMY 


3689 


17590 


A 


3712 


2 


393 


GLFSTOHKDIGTLYLLFGA*AGVLGTAL. 
SLLIRAELGQPGSLLGNDHIYNVIVTAH 
AFVIIFFIGIPIIIGGCGN*LVPLIIGA 
PDMAFPRINNISL*LLPPSLLLLLASAI 
VE AGAGTG * TVY P P LAGN 


3690 


17591 


A 


3713 


424 


1 


FFFFFFLRRSLTWQAGVQWRNLGSL*A 
RPRV 


3691 


17592 


A 


3714 


2 


130 


GR VGGR VG * NL KL YS QRL KKKKKKKKKK 
KKKKKKKKKKGGGPF 


3692 


17593 


A 


3715 


3 


197 


FHHVGQDGLNLLTL * S TRLGFP KC *DYR 
CEPPRPAAFFFFFLKKKKTLAPKKFKTL 
DLTTRERS 


3693 


17594 


A 


3716 


3 


497 


RSPPSGFAWPGPAGPWEAGAAAAGQPLG 
ARPHSPCTGGCRSAGTPP* PAGPAGPPP 
HAPAAAE R KPRGAEQ S QRRVQE TGP P P T 
RAESNPSGLRGREVPGSAGCWSGSQESD 
FGAGCPAVT*GTYPYRIAGQVSSRAPHP 
AESRGCTAGLSIGPDGRQ 


3694 


17595 


A 


3717 


40 


379 


AITS PRS PPAVHGS CLARTHPNS PQCLP 
LSGTLLLPSSSLSLSHPATAAGE * IWDA 
EKKKKKKKKKKKKKKKKKKKKKKKKKKK 
S PGGALKKKNPGGPNP P GGGKNI S S PPR 
G 


3695 


17596 


A 


3718 


445 


286 


KKKK*METRSCYVGQSGLELLASSDPAT 
LASQSVGITGVSHQAQPVNKLPRVQ 


3696 


17597 


A 


3719 


15 


383 


G Y FLEAS C LLHP LLH PM YQNCGHVACLA 
AKENGKFMQAACGQLNS PALS LKNGRLS 
AAYGGSDCVYLFPMSKMVLGTQ*ALNKY 
TLNGPGAVAHACNP S TLGG * GGRNLRS G 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VGDQPGQHGET 


3697 


17598 


A 


3720 


14 


472 


RS CARNPGLTHAS AQ YQTP LFV* SVLIT 
AVLLLLSLPALAAGITILLTDRlvrLNTTF 
FDPAGGGDPILYQHLF*FFGHPEAYILI 
LPGCAI I IHIVTYYSGKKEPFGYIGRV* 
AMISIGYLGFIVRAHHIFTVGIDRDTRA 
YFTSATIIIAIPT 


3698 


17599 


A 


3721 


3 


298 


EAAIKYFLTQATASIILLIAILFNNILS 
GQ * T ITNTTNQ YS S L I 1 IMAIAI KLG I A 
P FHF * VPE VTQGT PLTS GLLLLT * QKLA 
PISIIYQISPSLNV 


3699 


17600 


A 


3722 


86 


505 


AGAVTYIVTYYSL*KKDSFWIHSVWC*A 
MRVNWES*GLSV*AHHIFTVGIDVDTRA 
YFTSAT 1 1 IAI PTGVKVFS * LATLHGSN 
MK* SAAVL * ALGF I FL FTAGGLTG I VLA 
NS SLD I VLHDTYYWAHFHYVLS IGAVL 


3700 


17601 


A 


3723 


432 


48 


FKTSFLPFLLPSFCPSFPSFSLFPFSFS 
CFLSCPSSLSFLFFFLLSFHPFFFPTLL 
ISF*VFLF 


3701 


17602 


A 


3724 


2 


406 


HH I FTER I D VDTRAYFTS AT I I IAI PTG 
VKVFS * LATLHGSNMK* SAAAL * ALGI I 
FIiFTVSGLTGIVLANSSLDIGLHDTHYV 
VAHLHYVLS IGAAFAI IGGFIH* FPLFS 
SYTLDQTYAIIHFTMIFIGVNLT 


3702 


17603 


A 


3725 


174 


158 


KFFGGGFPHFFPPQKKGFFPKNPPRGFF 
TPPLLGKNFFFPPPVNLGPPRGFF*GP 


3703 


17604 


A 


3726 


2 


392 


LFS TNHNDMGTL YLLFGA* AGGLGTALS 
LLIRAELGQPGSLLGNDHMYNVIVTGHA 
FGIIFFI VI PI I IGGFGNRLGPLI IGAP 
DMARPRINNISC*VLPPSILLLLASAIA 
E GGAGTG * TAYP P LAGN Y 


3704 


17605 


A 


3727 


48 


474 


KKKKKKKKKTKKKKKKKKKKKKTRGPPP 
KKTPAGPPPPPPREKKNPPPEGGQKKPP 
PAH*KKPPPPRGGQQPPPPPPQKl\rPPPE 
KQKTWGAEGGTP P PKP PRKKNGSQQ I AD 
PPSNAAKNRHPTQEPPPQSQCIPPPPTP 
PP 


3705 


17606 


A 


3728 


387 


53 


PPPKKGVLPSPPNFYTPPK*GVFSPLPP 
* KFFFS PKGL I FWGGGGPNFP PPKKKFF 
S KKPRGG FF F P P * KKKI L ISP PRE KLGP 
PRIFLKGPPPFFFFFFFFFFFF 


3706 


17607 


A 


3729 


1 


397 


PTRPLQPLSPEFKRFSCRTSPDVAW*VS 
LPS PPPSGRS FFVFL * VESFCWI KKRF 
TMLARLVLDS * PSDPPAWASQSAG I TGA 
SLRASLL YI Y* CFS IQLGLLPEFL IQYV 
QLLIVYPRIHHLYLSVPHLH 


3707 


17608 


A 


3730 


407 


0 


TLKHrKKKKKKECKKKRGGLLKKTLGGPK 
*NGGRKKKI FFLKGGKKKQTGE I LKKKL 
ILGGEKKGKNPPKKKKPLGEKKNFKGEK 
GKKNLKMFG VI KLGTS S KKI I F 


3708 


17609 


A 


3731 


3 


200 


GGCSELRSCRCTLAWTE*DSVSLKKKK 
KKKGPFKVLRGKGPSGNJCREGAKKKI QG 
KPCPFWESF 


3709 


17610 


A 


3732 


3 


380 


TNHKDIGTLFLL,FGA*AGVIiGTALSLI J I 
RAELGQPGNLLGNDHIYNGIVTAHAFVI 
I FFIVIPI I IGGFGN*LVPLI IGAPDMA 
FPRINNI S F * LLPPSLKLLLASAI VEAG 
AGTG*TSYPPLAGN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leu cin e, M=Meth ion ine, 
N=Asparagine, P=Proline, 
Q=Giutamlne, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Ty rosin e, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3710 


17611 


A 


3733 


6 


394 


AM I S I GFLG VI G * AHH I F TV * I D VNARA 
YCTSATI 1 1 AI PTGGKVPS *LATLHGSN 
MK* SAAVL*ALRFIFLFTVSGLTGI VLA 
NS S LD I VLHDT Y YVRAHFHYVIiS IGAGF 
AIIGGFIH*FPLFSGYT 


3711 


17612 


A 


3734 


27 


238 


GXPGSS HAS AH I G * 1 1 AVL PH YLNI TIL 
NLTIDIILTTTAFLDLNLNSSTTTLLLS 
RT * NKKKKKRGGP F 


3712 


17613 


A 


3735 


402 


249 


ASAHASAPIiFV*SVIiITAVLItLIiSLPVL 
AAGI T I LLTDRNLNTTFFDP 


3713 


17614 


A 


3736 


100 


430 


VTWPKSDSYEEVKIKLEPRQTDLRALGF 
CLS FTGKKKK* KKKKKKKKKKKKKKKKD 
KRGG 


3714 


17615 


A 


3737 


158 


306 


FLSSVFSLVQGVLFSSLCLDRNLPDMMQ 
LWS E I FNK* L YLQELS I LGVY 


3715 


17616 


A 


3738 


279 


3 


PGHCFYNFNLNFYFQIFFQNVLANSGSL 
IiFL YQTL ILSFNI FYILYL I XQ13 1 VFQ 
SVIL* FIFSNSYSWWYGFLVCLMTFEGK 
KQTCAHV 


3716 


17617 


A 


3739 


268 


388 


HFFFFEMEADS VAKAGV* W YDVGLQQPP 
PPGVKRYS*LSL 


3717 


17618 


A 


3740 


395 


153 


G WLMR F KE R CGLQN I KV * GE AAS ADAE A 
AAS * PEDLAKI ADEGS YT* QWI FYVGET 
AFYWKKMPSRTFIAREKSMPGFKL 


3718 


17619 


A 


3741 


271 


386 


TCPPPLKLWVYELGLL*RKNMRQGAVAH 
ACNPSTLKG*GGRISRSGDR 


3719 


17620 


A 


3742 


406 


190 


QLLKRLRQENPWRWGG*GCSKPRSPPRP 
QAWVTERD F VS KS KT KTNQ P KQNGERKV 
FFWFFFLRKPKVQFEI 


3720 


17621 


A 


3743 


272 


408 


MGLQACANS PL * YVD I AI P CNNKGAHS V 
GLMWWMLAREVLRMRGT 


3721 


17622 


A 


3744 


211 


407 


KGKRN ILFG PVKN I F CDTQ FTS CVNFFR 
EE VS LDCPGWSQT PALK* S S CLGLPTCW 
D*RREQPRL 


3722 


17623 


A 


3745 


405 


189 


CVSQDGLDLLTS * STRLGLPKCWDYRKC 
WTNQLF VNGQKS LNS FVCYS FMVLDTTG 
FKEIVMLWHLVPLF 


3723 


17624 


A 


3746 


120 


362 


LKA*EISWLNGTIQTHQEKPEVQNQVIL 
EENI VFLGPQDRI FECQAVTAELEPEKK 
AIVAGCSDSHLYPSTLGGQDGWIT 


3724 


17625 


A 


3747 


401 


233 


NSGGRGCSEPRSLPCTPAWVTDRDPISK 
NKNKTNHQ * QKKCKR* TGKVI FMTRS I L 


3725 


17626 


A 


3748 


353 


109 


LGLANFCI FSRDGVFPCWPGWSRTPDLR 
*SAHLSLPKCWEYRA*ATMPGLLMCFLI 
KALDFQGCVGRWGGGS WTD I KLCQM 


3726 


17627 


A 


3749 


412 


221 


HGLKECLVYPKWCQNSNISLQGIHFSEV 
AM*QRAHNHGIHWFYHDSFPRKIWINWA 
RRGGSCL 


3727 


17628 


A 


3750 


1 


355 


FHHVSQDGLDLLTS *SPRLGLPKCWDYR 
REPLRPAENTVYVHFS SWSFTNHSGKIN 
*TLDQWKEQLPGWL* ILRGVFFFNPSIW 
NS * QKKAS I YVF P L C WKRD I FL YHF P VS 
GTFRGF 


3728 


17629 


A 


3751 


423 


125 


QVCEKVL I S L I TEMQ I KTTVRYHLTPVK 
MAFRQ KTDDNKCWRG YGE KGT P VHCW WK 
CKWQPL * RTAWRFFK* RKIELP YDPAT 
PLLGI * PKERKSGQV 


3729 


17630 


A 


3752 


3 


171 


FQPIAFLYANSEQSGKEESTSIYSSYEE 
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SEQ II> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E~G\utamic Acid, F^Phenylalanine, 
G=GJycine, H=Histidine, I=Isoleucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R==Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














YLGINLTKEMKDL ) *KENYKTLMEKIELD 


3730 


17631 


A 


3753 


2 


247 


FHHVS*DGLNLLTS * SAACLGLPKCWDY 
RHEPRRPALTRFLFVFLYLQRLDHLIRI 
NSFFIMSVNGMFTFWLKLFNLIHA 


3731 


17632 


A 


3754 


3 


197 


GLELLTL * SACLGLPKCWDYRHDPLCPA 
Q YL S FLYAINKLFS S T YLE I HNRL VCC I 
VTLLIYKP 


3732 


17633 


A 


3755 


2 


103 


VS QD GLHLLTL * S T LL SL P KC WD Y S CEP 
PRPAW 


3733 


17634 


A 


3756 


103 


2 


FRPGMVAHACNLSTLGG*GGGITRSGVQ 
DWPGQ 


3734 


17635 


A 


3757 


394 


244* 


AS TH P ANF F CRNGVLL CC P GWS QMP GLK 
QS S * LSLP KC* D YRCE PPHSAS 


3735 


17636 


A 


3758 


341 


2 


AEIAPLHSSLGMKRDSVSKKKELKKLNK 
QKPNNPLKK*AKDMNRHFAKGIQGANEH 
KIKCSTSTNLREMQIKTTVRSLLI * VRM 
AIIKKSTNISCW*ARREKGILIYTWWEC 
K 


3736 


17637 


A 


3759 


2 


322 


PPRPTRPLNNT INQMETI CM YTKCYL I I 
AE*TLFPSAHGTLIKIGHILGHKASLNK 
FKRIKIIQSAFFKHNGIKLENDKRKITG 
KHPNTWKLRKHTSKPMSQEMFF 


3737 


17638 


A 


3760 


444 


422 


VRLPHRPSPPPDPRVRFHHVGQACLKLL 
TSSDPPTSASQSAGITVPSkQ*QP*SVR 
LP 


3738 


17639 


A 


3761 


266 


1 


GGGVPLGPVIPPLRPPQRVGFQALAPQP 
GPGSKIF* IFFGPQFKNRVFFKGRNLVF 
FPPLFSLLPFFFFF*DGVSLCHPGWSLE 
CSGM 


3739 


17640 


A 


3762 


70 


381 


HSITSYI*AFIWGFFLFCFVFFLIKEME 
S H YVAQADHE L PAS WAP P T S AS Q YAG I S 
GGSHPTQHYFIFYWIFYFTVFCGGGFLR 
RS LAP S FFAQAGAQWRDLGS 


3740 


17641 


A 


3763 


229 


390 


DIPYSFCFFFFFWKGSFVFVPQAGGKGR 
KLG* LKPP PPGLRGFSCLTLPGSWEN 


3741 


17642 


A 


3764 


119 


1 


QTSCRPGAVAHTYNPSTLGG*GVDHLRS 
GVRNQPDQHGE 


3742 


17643 


A 


3765 


400 


200 


ITAYCSLDLLQPHFLFFSRDKVSLCCPV 
WSRTPELK*SPCLSLPECWD*S*ATASG 
LKHFIKSLPL 


3743 


17644 


A 


3766 


376 


44 


TQP I FLGPGFQAP PGPMG * PNFLF* E KK 
LVFFPKKPKTAPAGGGGPKFPLPRKPKQ 
KKGPHPGSQNFQKPRPKFPPRPPTWGKK 
REFFSQKKKKKKRERDVKITDLNTST 


3744 


17645 


A 


3767 


403 


3 


DHFFFKGPPSQKGEFPLFLKTFPSPLGG 
FG* ISLFPPPINFPPPPKKRIFPGPLSF 
F F FKKKKT Y PAGKGLL F F P F P KKP GGG F 
KRWPGPPKKFFFPPPGFWACSPPFFFFF 
FFGAKVSLVAQAGARSRSRTS 


3745 


17646 


A 


3768 


407 


177 


FHHVS QDGLDLIjTS * SARLS LPKC * D YR 
HDPPRLARCIKNFCRGWRCNILRKSPLS 
LPPSAGELRAPFLDFLEDAQ 


3746 


17647 


A 


3769 


292 


391 


RPGWAHTCNP S TLGG* GGQ I TRSG VQD 
QTGQY 


3747 


17648 


A 


3770 


395 


198 


LDMGFHHVGQAGPELLTS SDLPASAS QS 
AAI TGMSHRAQP Y * VI SMCQQCYKHF I R 
IPFNLHNNV 


3748 


17649 


A 


3771 


1 


241 


RTRGRTRGRTRGLLSAHTGRYQKHPRIR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, MHMethionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan 7 Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 






m 








TPQIFSTGTTKAEAGELLEPGGGACSEL 
RS CHCS PARATRVKLHVKKFKKI K* KHP 
RIRTPQIFSTGTTKAEAGELLEPGGGAC 
SELRS CHCS PARATRVKLHVKKFKKI K 


3749 


17650 


A 


3772 


112 


406 


FFFFFFFFFFFFFFQGEGVPPI *KPPPP 
K*FMFPKKKLPPPPPPALKLFFFSPPPP 
FFFFF 


3750 


17651 


A 


3773 


24 


197 


IFSVEMGFCHVGQCDLDLLGSSNPPASV 
S * SAGVIGVSHHAWPLVKLF VKRNKLS Y 
V 


3751 


17652 


A 


3774 


85 


201 


VNFFHLI *FVNKRI KLEKKKKKKKKKKK 
KKKKINWKKKKK 


3752 


17653 


A 


3775 


1 


382 


WENHLS PGSRGS SEP* SHY CTPA* VTE * 
DS VS KKKKPNI.QE AEKRLTLLL I KKAI F 
KNRGS PCFS I KWAGFKKI KQGWFFVATR 
MGDRSRGTPNLGENFLKSFLSFSIKIQR 
FYYLGLKNSPSRNIY 


3753 


17654 


A 


3776 


1 


181 


IKMATVTKATYRFCVIPSKLPMLLFTKL 
ETKILKLI WNLERA* IPKAILS * KNKMG 
LSNS 


3754 


17655 


A 


3777 


70 


385 


LNKMQTKTFDSDTHTSICFTLHFINTKL 
QFIFLLAIYLLFVNFPFIFINFFEEPGF 
CFDDLS S CFLFY* CIP* FYYFFSAFFRL 
HLFFFYRFLSLLTLDLFSFLI 


3755 


17656 


A 


3778 


2 


406 


KDDRNTEHQVHPNASDQEDEAFDPYENF 
FEDSDSPTKSSSTEPSPHIHPVDIQMTI 
FHCADNFARQ Y I LAKLAKKKKKKKKKKK 
KKKKKD S S S P GGG P KKGAP P P P PRGE KN 
FFFFGPPLIFSGGGF*NGGGGGP 


3756 


17657 


A 


3779 


403 


176 


FLFPI LFFFLRQS FTLVTQTGGQQRDLG 
SMQPPPPRFK* FNEGKGKRKLWGGA 


3757 


17658 


A 


3780 


3 


248 


GLNQTQLRKILAYS SITHIG * I IAVLP Y 
NPNI TILNLTIYI iltttackkkkkkkk 
KKKKKKKKKKKKKKKKKKKKGGGGA 


3758 


17659 


A 


3781 


3 


330 


VDR LRTG VQDHPG KHS ES P S L P KKKKKR 
* KI LP KRGGVYLNPHFFERLRGENNLTP 
GGEGQRETESPPCLPAGGKNKIFV*KIQ 
KKKRGGPFKGKKFFPRGGGKKXFF 


3759 


17660 


A 


3782 


399 


187 


FPPPAKGDFLPSHLFGLPPGFSPPPVFK 
PRPRNLILGPP*KKFYLPPPRP*TWFL* 
NGPPPFFFFFFFLDF 


3760 


17661 


A 


3783 


392 


11 


KKLKVNLPDHPAVPPLGIFPKEMKAGI * 
RAICTLMFIAALLTIEK*WKQPNWLLVD 
K 


3761 


17662 


A 


3784 


324 


80 


PGPFFFFPPPAKRGFFPPPFFLVPPGFF 
PPPLF * TPPPDFFFLAPLKKFFFS PPPA 
LNFFFFKGPPPFFFFFFFFFFFFFF 


3762 


17663 


A 


3785 


354 


24 


PPNKIFFSPPPQKKYFPYKTPPPTFFFL 
SSFFFFFFFFFFFFFFFFFFFFLC*MVH 
F I VKYRKCKRGTR I I LHTPTLRQLVLCC 
LiS D VLLGC C I F I Y VTMCDL CAP KLG 


3763 


17664 


A 


3786 


3 


369 


SCRGLVOTYGELLLHPCRGNQISRVMQK 
KKKKKFGAS PPREGAPPGEPP *RGP* TQ 
KE P P P L KK * AP P P G PGGGR P P PMGLGGG 
PAPPPFFWPKAFFKKGPFSGQEFGFPLK 
GLFFWKFPF 


3764 


17665 


A 


3787 


1 


125 


FRHR F S RDGLDLL IF* S ARLGL P KC WD Y 
RREPPRPAHSIII 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3765 


17666 


A 


3788 


-427 


251 


LTPKGRGCSEPR* CHCTPAWVTEGDSVS 
KKKQAKNRCS LRMKYERNE FTSSPC 


3766 


17667 


A 


3789 


250 


2 


GFPPVEKIKKNPFPIF*KKPGFFFFPNP 
FF*ISGGKGKIFFLGPPPFSFFSPFFF* 
DRVLL CHS GRSAVAQSQL TATS TFRR 


3767 


17668 


A 


3790 


412 


3 


GSTPITTPFWGVGGALPPGGGVPPPPAP 
PGGTPFFFKKKKKKKFQNFPRPRCSRYN 
GGLGRKNFFTLEAKGSINPNFSPAPPPG 
GQNKT P F P KKKKKSR * MTLQS FPRPGTA 
PGAAM*PVSPPEPCGPPAPALSTK 


3768 


17669 


A 


3791 


10 


187 


GCSEL*LIHCTPAWVHSETLSQKKKGRG 
KGGFKKNGIPCFGKNKGGKRFGKSFGHF 
STG 


3769 


17670 


A 


3792 


1 


331 


FRVPLGRLSRGEHQQQ*QQQPPPPPPPP 
GPLRPLAGPSRKGS FKIRLSRLFRTKS C 
NGGS GGGDGTGKRP SGELAAS AAS LTDM 
GG S AGRE LDAGR * DRP GAGRQ T S F S W 


3770 


17671 


A 


3793 


1 


431 


FRKGSARCRAPKPAGADGVHWGHHRAS P 
QTAQAPPVTCSARGSWMDFSHASGPQDR 
DLHS P S PVCPRGWAPGTHS PTGASGA* G 
RTPVGGGK*VAGVQQRIHPGLAPG*LVL 
PGIAVHITKKPGSKGTITPGTVAHACNP 
NTLG 


3771 


17672 


A 


3794 


342 


48 


KRENMPGVAYNPSTLGGCNPSIVGGFCN 
RVCNPS ALGGCNPS FLGGFCNHVCNPS P 
LGG*GGRISRSAAQDHPRQHGEKMSPQK 
I QK I S QAC FARVG 


3772 


17673 


A 


3795 


2 


189 


LNPGGRGCSEPRS CHCTQAWATE * DS VS 
KKKKKKGSRFQNTI PSALPGFPVSGELI 
LGGPNL 


3773 


17674 


A 


3796 


1 


401 


LNPGCKGCSEPRSCHCTPVWVSE*DPIS 
QKKKKKKKKK 


3774 


17675 


A 


3797 


279 


3 


ILAKVYPMVNSFLEKIHYFLLVELCCFL 
QHVL Y YLT F KT TE TG S RS VAQAG VQWHD 
RSSL * PLPHGLKRSFFLSLPRS WDYMHT 
PPCLATE 


3775 


17676 


A 


3798 


229 


394 


VFVKS LLCQPGMVAHACNPS TLGG * GGR 
I TMSGD * DHPGKHGE VS T YNTKKKKGG 


3776 


17677 


A 


3799 


380 


206 


AGHM*S*LLRRLRQEDPLSPGVQGCSEP 
* S * PCPPAQVTVRI SLKKKRLHERDKS S 
AS 


3777 


17678 


A 


3800 


450 


3 


I YLLRGRHS LH YS R P S F Y FSN YRGACS Y 
MRPPQQATLLGRLNQGITREGPAIAHSP 
AVRPRLSLAHAHVRRLCLPRPGAAPEFP 
AFFLPCLLVAQAGVRWRDLGSLQPPPPR 
FKRFSCLSLPSSWDDRRPPPLSS*FAFL 
VEMGFRRV 


3778 


17679 


A 


3801 


301 


10 


KKQPPKKKKRCFPPLFPPPKGFKGSPPF 
FFLTPPPLKNLPWTLKALPQ I FLRTFFF 
FFFERGS PS VTQAGVLWHDLGSLQP * TP 
GFKHMP S CP VLLC 


3779 


17680 


A 


3802 


420 


51 


SASSPPASFFFFPPPKKKIFSHPFFFFF 
PPFFFPPPFF*PPPPFFFFFPPKKKKIF 
PPPPQKIFFFLKPPPPFFFFFFFFFFFF 
FFFCGFVNCQFSVIiI 


3780 


17681 


A 


3803 


253 


73 


QTTPSKRYSYVLNCCNN* ITTCKRIKLD 
P FLTLDLKIN* KS FADLNVRAKT I DTI K 
KVKG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
TJSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanme, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S~Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop cod on, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3781 


17682 


A 


3804 


385 


55 


AGTRPVFAALAAAAPAPRLLPLAPHGSA 
ATGFP PP P CPHGEGLRGAP AAS S S P P LA 
HRSQTPNWVDCGESPIPDPQFPLL*NGP 
PALCLPQGLETVNVESLALCFTSHGN 


3782 


17683 


A 


3805 


191 


2 


QKHLKALNFLPRCVCVCVCVCVCVCVNV 
TF*ACKI CTCSDFPGSPRKKYLEFGTKI 
INIVDFN 


3783 


17684 


A 


3806 


1 


367 


KLRQVI YLEDLPT I I DEGS YTKQWI FNV 
N*TAFYWKKMPLKTSVTREEKSMPGFKA 
S KDRLT VFLGANWSDF KLKPMPI YNS E 
NPKAIQNDAKSTLPMLYKRNNKACIAAH 
LFTAWFTEYF 


3784 


17685 


A 


3807 


3 


145 


LGPGGGGCSEPRSCHCTPAWATRGKLCL 
QE KKKKR * RLD P E AE VAVS RGRAI AL Q P 
GQQEGNSVSRKKKKKGRDKVSL.CCPAGF 


3785 


17686 


A 


3808 


362 


86 


NPPFICSPPFFPPLWGPKQWPPWCEG* 
TPPGPPCEIPLPPKKKSKSPPGVWGPPF 
FPPPWGGGAPKFFLPPKWGFPQTQICPL 
PFGLGEKN 


3786 


17687 


A 


3809 


245 


377 


FGYMPSFFETQSRSVTQAGVQWHDLGSIt 
QPLPPGFK*FSCLGLiP 


3787 


17688 


A 


3810 


3 


268 


KRNFLNLIKRRYEKPIANFVFNSKIWT 
FHLRSSTS*GYPLLPLLLNILKVLPNEI 
RQQKE I KGRKNKGKKTISVYS IML.SANT 
EKPK 


3788 


17689 


A 


3811 


2 


388 


SAENCLNPGGRGCGDLRWCHCTPAWVTE 
RDCVSKTNKKETEKYMRN*TGNMLTX ID 
AGRRYMRCHQTIFTFFFFWRGSLIFLAP 
PPSQGEGIFLPHPPEKWKKKVRITTPTY 
FCFLR.TDGFLPCGPGGF 


3789 


17690 


A 


3812 


292 


136 


PPTVIARKGKPRPKLSSEPWQFSAEQLG 
LPSFLCFFVCVRCCCLLLFLLLFCRDEV 
SICCPGWS* TPGLKQS SHLSF PNCWDCR 
CEPLCRAIF*FFCLCTLLLFAFVSAAVIi 


3790 


17691 


A 


3813 


359 


52 


KKGRFFFFEKNPFGDLVPKKKKKKTPPP 
PNPKKKKRGAPPFFFFFFFFRH*VPLCC 
LGWFRTPGLKGS S C VGLPKP WD YRHKPL 
PLTKDD Y YVYGLP FVS LQ 


3791 


17692 


A 


3814 


382 


1 


GGPGMEAPLGYHI KTPFPLKPQKNNPGG 
GAKPWPPPPGG*AGKMGLPPRQRFPLT 
QKGPP P P PLGGKRE P P PQKKKKKRVTAM 
GDP*NSECR*GCHLRVQKEMRLPGLRNS 
PPHYI PNRNTDMCS C 


3792 


17693 


A 


3815 


472 


340 


TGFHRVSQDGLDLLNS*SAGLGLPKCWD 
YRRGLFX I I FNMGEWS 


3793 


17694 


A 


3816 


247 


378 


PPPPGGGXGXNLGLLKPLPPG*RErPPP 
PPRGGGNPGGAPPPPG 


3794 


17695 


A 


3817 


25 


465 


TPPLGG*GKRFPRGPGFSPPPPPKGNPP 
FF * KFKKKPRPGGGAP * FPP PGGGRPKN 
PLPPGPKGSH*PKFPPPPPPGETKRKPP 
P P KKKKKKRKKE KKKE I * AYKAY ICFDC 
PLTGPGDLKSDKTLAFPSVIGLPRPCTC 
LSPR 


3795 


17696 


A 


3818 


475 


147 


PPKG*RHPPGPPRFRRGPGNPPPHRFFF 
VLRTPGYSFPF*KKRGPPPTFFFTLPGP 
PGFLKGPPSPGGGGQKIPFPRGGGGPLK 
*IPPFPPPPGKKKDFFFIiTKKKKKKP 


3796 


17697 


A 


3819 


461 


57 


TNQKKAAPPPLLKKKFFFFFKKHTGGAP 
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PGEKKKRGGGEKKKRGVYRKGPRAPSRG 
CAPPLYYRGGLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFHVKNILFVF*DYIVIIADL 
IQSLRHKDFQELLRR 


3797 


17698 


A 


3820 


1 


418 


HAYHI* KPS P * P LAGALS TLRMTSALAM 
* FHFHS I TLL I LGLLTTTLT I YQ * WCDG 
TPESTYQGHHTPPVQKGLRYGIILFITS 
QAFFFAGFF*AFYHSSLAPTPQLGGHWP 
STGITPLNPLEVPLLNTSVLLASGVS I 


3798 


17699 


A 


3821 


385 


1 


LQFPPFSPLLFGVQRPFFSSFPSCPWWG 
PVPSKLLPLLQILGRGFYPQKSLLVSLG 
FFFFFFLI I KKHRNEVLLCCPGWSRTPE 
LK* SSHLGTPKCSDYRCEPPHPPDPWDI 
CS YVPFLQS THFGLN Y 


3799 


17700 


A 


3822 


72 


424 


KKKKKKKKKFFPRKKGAPPAP PLFGKAG 
GEKL FP PRGL I FKKPKKAP GEKKKNL FK 
KKKKRGGP*KKKKLKAGGGEKKIFFKGA 
QKKNPGAGFKKGGRGKNRGAPQKKGGEG 
SSS 


3800 


17701 


A 


3823 


154 


2 


GHHTQHTEPQRLNNSYRYAELKHLSNTK 
NNP I * KWAKDMNRHFS KEDIQM 


3801 


17702 


A 


3824 


422 


212 


MGSSNPSPTASWVDEPTGICQHT*FYVE 
MGFCHVSKAGLGLLGLRDPPPSASQSAG 
VAGVSHCTRLWI IL 


3802 


17703 


A 


3825 


2 


417 


KRVNTRGFPPPPFFKTPPPKIFMGPQKK 
KNFSPPPPIKIFFF*KAPPSP 


3803 


17704 


A 


3826 


423 


65 


ACGRGKYFKSGGRVKPFCAVDFPPSGKK 
ANQPTPKNPRKKQKPSGRGMSPMGKPPG 
CPCPMPKASPGPVSFNKGGVAPATKDFC 
PIGKTA* IPSKKKKKKSDPG* VKKGKPN 
LYSIYNK 


3804 


17705 


A 


3827 


285 


136 


FFFLFFXFLFFFFFFFFFFFFFFFFFFF 
FFFFFFFFSGA*KIFIIATIHY 


3805 


17706 


A 


3828 


425 


316 


QDGLDLLTS*SARLSLPKSWDYRHEPPH 
PAQTFFKQ 


3806 


17707 


A 


3829 


3 


169 


LSNS * ANNL I * IiAYTIAF I VKI P L YGLH 
L*LPKAHVEAPIAGS IVLAAVLLKLGG 


3807 


17708 


A 


3830 


6 


452 


IGFCQGFAKDFCSKKFQRNVRVSPRGGE 
GRKFLLFKGGGFQKFPPLPPGWGAKTPF 
LKPPPKKKKKKERKEKKEKEKKRNPGPP 
VSPSSWSPVP*APSEQRSGPQLPPSRGD 
* WRL PS P PRPFRDCVRRS ATRGNS PPC 
RGSQGP 


3808 


17709 


A 


3831 


45 


387 


NFGVKVFS *LATLHGSNMK* SAAVL*AL 
RYIFVFTVGKKKKNKKKKKKKKKKRGGT 
P PGGE KRKFFS P KKKKKKGFLKNP QP KK 
NFFWGKPQKKKGGKTPFWGGGKKAPKKT 
P 


3809 


17710 


A 


3832 


463 


288 


DGLDLLTS *SVRLGLPKCWDYRHEPPRP 
APCILMATFFQFEVLFSEFLETIVKSVL 
LH 


3810 


17711 


A 


3833 


294 


90 


FFFLFFFFFFLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFF*AGIYFI 
TCAPPPAYPRS 


3811 


17712 


A 


3834 


427 


37 


SYHFGRL>RRADHEVRRL*NRKGRPGAVT 
HACNP I TLGG * GGR I MRSED * DH 


3812 


17713 


A 


3835 


499 


53 


AKXMYFQSRENVAKRSYRRNQKASMVHP 
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NQEPAVI AGQGT I ALE VLNQ VK* LS S YH 
CRLL VGM YHG CLLVS S VT P FCN I F S GLK 
VH 


3813 


17714 


A 


3836 


304 


3 


RLKI TKTNNNQTE INS * YI KDLNARAE T 
I RLLE EN I GG KPHD I RFAM I F W I * HL KK 
TKNIYKLDYTKILDFCISKDTINKVKR* 
PLEWEKIFANHISDKG 


3814 


17715 


A 


3837 


475 


135 


MEGLFCPAPMVHP F I PPPLGG * GGGS PG 
PGIQTPPGYPGEPPFFFTLKKLPRRGAV 
FFFPPPLGGLGPKNVFTPGAGASLEPGW 
APSPPPGGPP*NPFPKKKKKKRLKIGWH 
F 


3815 


17716 


A 


3838 


415 


245 


ETGFHHVGQDGLDLLjTS * S ACLS LP KCW 
DHRRE P PHPD P Q VS F IRAL F P FMRAP L S 


3816 


17717 


A 


3839 


460 


413 


WDYIKLTGFCTAKETRVKRQPTEWDKTS 
AKYI SDKGLKTRLHKEQ * KMTNLI *NWV 
K*LGRYSH 


3817 


17718 


A 


3840 


2 


470 


GGDAPHFPDGVAARRRGS S LLRWGGCQA 
QGLLTSQTGWPGRDAPHLPDGVTAGPRR 
S PHLRR WAAGQ TR S S LPR CD GGREEALL 
TS*VGWRPGRDAPHFPDWAARQRGSSHP 
RRWAARQRRS S L P RRGSGRAE AAI SNS A 
SLQRVSKVSVSCRPGG 


3818 


17719 


A 


3841 


54 


332 


QFGFHHVDQAAPELLTSDHLPSSASQSA 
GITGVSHRARPSXiAHSFTHSLAIi*PTPA 
HSLIHWLTHSLTHSLSFPCSFSSPPSHT 
HSIARPLGV 


3819 


17720 


A 


3842 


208 


468 


ENPTPPPLWCFALGVIi*EDGFSIiYSLSA 
HVCE I VFL YCP VLE S ARPLMFLWNLAHV 
CYYEWHLFSVSLEFHTVS 


3820 


17721 


A 


3843 


406 


2 


PSSSSSSSSSPPAAFFLGCPGVFPSPPF 
LKPAPGFFFWGPKKKNFFSPPRGVKFFF 
FKGAPPLFFFFFFFFFF*DGVLLCHPGW 
SAAHAS 


3821 


17722 


A 


3844 


370 


253 


VGLLEPRGQGCSELRSHHSTPAWATE*D 
PILKQNETEQK 


3822 


17723 


A 


3845 


369 


73 


L.RQDNGMN PGGRAC S E PRS HHCT PAWAT 
E*DSISKKRKSQNSVFTESYFASEHCYS 
L TGGG VA I LME GP TG P DGAI FM CHLRNL 
CRCSPILPLxALQCDH 


3823 


17724 


A 


3846 


198 


388 


FRKS INVRYHVNRINLKNHT 1 1 PIDAQK 
P * EKLQQHLTMKYS I IMNSRKLSQPALR 
NLF*KPS 


3824 


17725 


A 


3847 


391 


20 


GFPPPRVF*HGPANFFLGPPKKKFILPP 
PR**TWFLLKGPPLFFFFFFF*DGVSLC 
RPGWS AVS KN* QTNKKS IILPI RGKLNL 
FVHSML I KGFTCDLL I FKNRKVKE S TKT 
IDHIFSRQLMA 


3825 


17726 


A 


3848 


227 


2 


KRRNPFHKAI AAIDS SDGPR* S KLKNFW 
RGFS I ADAI KI FRDS WE VKI S I LTGFWK 
KWILTLMDDFEGFNRTRGR 


3826 


17727 


A 


3849 


373 


2 


SSPQKYWGGPGFFPPPGFKTPPPPFFFW 
APTKKKIFFGAGRENFFFLRGPPPPIIF 
FFFFFFLFFFFFFFFFFFFFFFK*YNSF 
I *DIKTHVFLLHS VGFRVLVYKTRLYFT 
LYHSFLKNELG 


3827 


17728 


A 


3850 


394 


2 


EACSVTQAGVQWRNLGSLQAPRRQSFCL 
RGSKLS PQH* EDRT * HI GGTQQTSMNE * 
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MNLYLQWLCPLLYQLCSTSYT*EVIQLN 
IAFCVCLVGFFCFFVFLFFETASQSFA* 
AGVQWCHHHSLQPQLPKL 


3828 


17729 


A 


3851 


3 


164 


NHLNLKGRGCSDLRLCHHTLAWATEGDS 
I S KT KT VQ * RKLQNN I PHE HT C KNL 


3829 


17730 


A 


3852 


35 


390 


NFVSNSFFFFFFGNGAPRGPRGGGQGPN 
LG*GEPPPPG*KQFSFLTQPRGGNNGGG 
PPPRVNFGFLRKNGVPPGGPGGPQPPDLi 
GTGPPPPPEGSPPPGPQFSKTKRTAPGQ 
PKGGPF 


3830 


17731 


A 


3853 


6 


437 


PNK*KTPGLPPPPFFYFFPPPPPFLSFL 
KPFSFPPPPPPPLFFFLPPP 


3831 


17732 


A 


3854 


1 


236 


PTRPLRWIjRGENRLNPEGQGCGEPRSCY 
YTPAWATE * DS I S KKKKGGGRLKGP TIiT 
SGGGQGLFFLRGPQKSTSGAAF 


3832 


17733 


A 


3855 


52 


351 


TVGIMEPQTTDVYVKLHKCTDFLLPDYI 
FNKQKYPKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRTKQKKKKREKKKKRGGGAPQ 
*LTLSPPPPPLYYFFF 


3833 


17734 


A 


3856 


72 


420 


NT YG KKKKKKG KKKKKKKRGGRE KE KLW 
EAGGGGTTRGL* KENSGGNMGKPRIKKK 
GGKARGAPLGVS S TQLL * RGEGE KCLS P 
GGGGCSEHKTTPLAYPGLGKKKETQSGK 
EKK 


3834 


17735 


A 


3857 


410 


107 


SSSSS PLARYFLGVAPGFPLPTLiFKTGP 
RIFFLGPP*KTFSLPTPGR*T*SP*KGP 
PFFFFCRDRVLLW*PGWSEMCGLKQSSC 
LCLQKCWR YRYKP PQLA 


3835 


17736 


A 


3858 


419 


239 


TTRS QLLL FLVEMGFHHAGQAGL KLLTL 
R*SAPLPKGWDYRRESLYLADA*GFYNI 1 
EDPL 


3836 


17737 


A 


3859 


262 


1 


TAAQE FNFGGP I KKI FRCQPGE VKLDP F 
KRPPFFFFFFLFFFFLFFF*RRVLPCCP 
GWS * TPEFK*STCLGLPK* WDYRHEPPN 
PAS 


3837 


17738 


A 


3860 


357 


133 


NTPPPGAGRNKENPIiFPKKKPRGVGPLF 
PPPRKAQTEEWLYPGPK*FFFFPPPPPG 
GKNTKKKPPPQKKKKKKSL 


3838 


17739 


A 


3861 


253 


375 


TIiSPQG*GGPPPPPPRGGGLEAPPPPPG 
LFFVFWGKRGVSP 


3839 


17740 


A 


3862 


3 


482 


YITDEV3MDPSLTIKSIGHQWY*TYEYTD 
YGGLI FNSYILPPLFLEPGDVRFIj 


3840 


17741 


A 


3863 


1 


129 


IiANSNYERTHSRIIILSQGLQTLLPLIA 
F* *LLASLANIAKEG 


3841 


17742 


A 


3864 


289 


372 


NHLVKKWAKDLNRHFS*EDIQIAHRHMR 


3842 


17743 


A 


3865 


266 


31 


KICFIFSP PQKKS VFFPKVRFPQKRKP P 
PALLFSAYFFFEAESSSVTQAGWQ*CN 
LGSPQPPPPGFKRFSCLSTLGG 


3843 


17744 


A 


3866 


1 


395 


NTSDFC I KPRAYNESEHHWDMVRRHL*G 
KEDNLTLD I SKLKEQ I FETS KAQLNLVS 
E TEAMVKAVDSLTNLNP VTWVKT I GISTS T 
I ANFVL I LGCLAS LLWYRY I QQLRRDS 
DQREGAMMTMAVLS KRKGG 


3844 


17745 


A 


3867 


137 


2 


APIj I KKHE CLRAVAPACNFS TLGGRGGE 
ITR*GV*DQPDQHGETP 


3845 


17746 


A 


3868 


1 


108 


ENRLN P GGRRC SEPRSCYCT PAWAT ERD 
SMS*INNK 


3846 


17747 


A 


3869 


198 


302 


LLNINHNYGGV*GCSEL*SHHCTLAWAA 
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AS LH P KKKKKKR KKKSVKS QGAF YS H YA 
MAILNYIREKKLFLKWK*LLNINHNYGG 
VLEKKKP*SYTYNLKKSQNLPQLL 


3847 


17748 


A 


3870 


237 


419 


KE I QS KQGLRAI C I PMF I AAS LMIDKG * 
RQPKCPWREE* INKLWYKHLMAYHSALK 
RNEIL 


3848 


17749 


A 


3871 


239 


232 


KAFLHTRVKRGIQPGAVAHACN-PSALGG 
RGGR I TRSGDRDNPA*HG FS WGKCLG X E 
LVGHMFNFLRNCQTVFQSGYI I YILPMP 
LDS 


3849 


17750 


A 


3872 


417 


219 


CHVAQADL KLL S S S D P PAS AS Q S TG I AG 
I SHRAWPRGTNP IHKGS TLMT * LPPQRP 
HKLTLSPRGF 


3850 


17751 


A 


3873 


45 


414 


NILFFFFFWKKSNLVPQGEGQGQSLG* W 
NPPPRGLRGFFNLTPPRTGNKGGDPPPQ 
ENFGF*RKGGVTQGGGGGSKTPALGGPP 
GLTPQRGGNNGQDPRPPAKKFGLGKKKK 
NRQLPFLGFFR 


3851 


17752 


A 


3874 


278 


2 


L.ISIYVSIIYLYHSGFQLSKISPPQGQL 
AYLSIYLSVCHFLIYEIGVITMPASKID 
K*DLIKIKSFCTAKETIIRANR*PTEWE 
KIFAIYSS 


3852 


17753 


A 


3875 


233 


379 


QCD YFNELLiRME I LGPGAVAHACNPNTL 
GGRSG* ITRSGV*DQPGQHGE 


3853 


17754 


A 


3876 


6 


247 


MlilTSASYPEDLAKXIDEGGYWKSQlFN 
VDETALN* KKTPSKTFTARDKSMPGFKA 
SKDRLTLLLGANFKLKPMNDLIIiI 


3854 


17755 


A 


3877 


411 


3 


AKI P PL TS S QGHRTRVCQ KKRERERE KE 
KEEGRKERERERKKGRKKERKEERKGS * 
VRMLFLDKVDFRSQKI ITDKEGHYIMIR 
E * MHQ * D 1 1 LNM Y I SNNRALKYMKQKLT 
E L IGR VGR PGRS GT VP GRRNGVQ 


3855 


17756 


A 


3878 


174 


375 


SFQQ*TAELEIKNTTSFTLTPPKLKYLG 
INLTKYVQDLYK 


3856 


17757 


A 


3879 


1 


106 


FHRVSRDGLNLLTS * SARLGLPKCWDYR 
REPPS PA 


3857 


17758 


A 


3880 


383 


24 


FALNNFPQGGAFSGPLTLS PQVGLFFAP 
GPPPPGGGFSCPLAQSFFRIFPQKSGPP 
ALSFSPLFKKNPWFFLRTQPLLPLFPQ 
PRVFFFPPFFFFF*DRVSLCCPG*SAVA 
QYGSLQP 


3858 


17759 


A 


3881 


400 


71 


RIRQENRLNPGGGCYSEPRSRPRTTAWV 
TG*DSISKNNNKNK*NLSFIYFYYYFLR 
WSLTLSPKL 


3859 


17760 


A 


3882 


282 


3 


I FNFCGY I VS VY I YEVQE IF* YKHAMCN 
NHIVENGVS I PS S I YSLCYKQSNYI LLV 
IIiKYAGWAQWJjTPVI PALWEAEAGGSRS 
QEMETILA 


3860 


17761 


A 


3883 


1 


175 


LYKNFFKLAEHGGVYQILRRLRQEDCLS 
PGG*GCSEFKSCHCTPAWVTQQDPVSKK 
FF 


3861 


17762 


A 


3884 


415 


233 


S QDGLDIiLT P * SARLGL P KC WH YRHE P P 
CPAANFFLKARYFWLMVARYFWLWCNYS 
ALPS 


3862 


17763 


A 


3885 


237 


367 


GPVLGPGMVARTCNPRILGGRGRWIT*G 
REFETNIiAJSnVlAKPCIj 


3863 


17764 


A 


3886 


359 


108 


TGQGGGGP P P P PLGRLKKKNGVNP E GKN 
SNKPKLAPS P P PGGKRKP PS KKKKKKKG 
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N=Asparagine, P=ProIine, 
Q=Glutamine ? R=Arginine, S=Serine, 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














G*HLPMLPRLGSNSWAHDMILPPYPPK 


3864 


17765 


A 


3887 


245 


377 


QFIKNISWPGAVAYACNPTTLRG*GGWI 
KRSRV*DHPDMVKPHL 


3865 


17766 


A 


3888 


406 


2 


SLNI FFFFFFFFFFFFFFFFFKPGRVFP 
LKtfFFFFPFKGCRIPFPPIFPLWLKVFF 
GPTPL*NPRG*KEGFCPPPFFFFFFFFF 
* GRVLLCR PGWNAVAQS RIiTATS AHAS 


3866 


17767 


A 


3889 


3 


114 


EFHCVS QDGLDLPTS * S AHLGLPKCWD Y 
RRKPPCPA 


3867 


17768 


A 


3890 


395 


1 


MVLIS * PCDPPTSASQSAETTGISHRAR 
QFPMLLSQ*LVCHSVTEVFLIFLLKLFY 
IYDILLYIIYNYILLIIYYLFLFLLFFE 
TGSCSVIQTGVSSTITAHYSLKSHPSGL 
KRSSHLSLPSSQDYRCAPT 


3868 


17769 


A 


3891 


2 


189 


GRVGERGGTLL* SQLLRR* ENHLNLGGK 
GCSEPRLHHCTPSWATEQDS ISKKKRS F 
IKALHM 


3869 


17770 


A 


3892 


410 


258 


GC*GCSEI*SRPCTPAWATE*EPVAQKE 
KRKKKRKE KKKHVLATLA I SSLICL 


3870 


17771 


A 


3893 


244 


412 


GIEEACGNFRVIVLKSSLALEGEEPSGR 
DS S * E AS GGRARWL TP V I PALWEAEAGG 


3871 


17772 


A 


3894 


208 


2 


RISPPITPFPPWVTKKTKG*PGFFFQKN 
* KS FFFFLRWSLALiVPQAGAQ * LDLGSL 
LPPPPGYKPFSC 


3872 


17773 


A 


3895 


407 


18 


PLLGHPPKKAGAPIjIFNLPRRPK*KPEF 
PPPPLSPTQPPPPPPPPICPAWLPPPGV 
PS S FKG* MGGPP * RPEV*GP INPFWAPS 
LPWPQNPIPFPKKKKWGEIHCMVCPLS 
FNETFSFFKMESIjRDGK 


3873 


17774 


A 


3896 


82 


401 


KGFLKNSPKGEGLFLFSFPFFFLFLATW 
LWPRLKCSGAIIARCSL*LLGSSNPSDP 
PTSTFGEAGITAAGHAQFPPPDSPPPKT 
RAHH VTQPGLKLP S * S S P PAS A 


3874 


17775 


A 


3897 


263 


396 


VWWPGGLG*Y*TPPLLGGQGGRITRSGD 
QDHPG * RGKPVSTKNT 


3875 


17776 


A 


3898 


2 


386 


KTGLE LLGLSNP PAS ASQS AG 1 1 GMSHC 
AWPNLFLYWNHHLYPHKK*VWPGTVAHA 
YNPSTLGG*GRRIA*AQEFKTSL 


3876 


17777 


A 


3899 


140 


33 


IFFFHL*GEIGFTMNMLDIGGGFTGTEF 
QLEE VNFS VND INKI I AFF VT * GVDV* S 
RIFTMAKTKPYNISLKLCS * KTF 


3877 


17778 


A 


3900 


377 


2 


TPPKENGGLPKI KVPYFRGNKFKKFPLC 
FSKVPPISTMWVLKNQFQPPIPFLKKNF 
FKT P KKGVSH YMF P PQPKS * GVF FFS KK 
KNFYPFFWEEQKKQRGPNAFLFFLKKKK 
RAAARPTRPPTRP 


3878 


17779 


A 


3901 


2 


123 


RVSQDGIiNLLTS * STCLGLLKCWDYRHE 
PPCPALRCVFYG 


3879 


17780 


A 


3902 


188 


3 


LRCTWCPGTDGWVSHLAHL* FKKGRPGA 
VAHAC S P S I LRGQ GS W I S RS GVQNQLGQ 
HVETP 


3880 


17781 


A 


3903 


422 


238 


ILHVQLPSSWDSKRKPFFFFL*R*GFTM 
LPKLLLNS*GSSDPLASACQRLGITGMS 
HYTWP 


3881 


17782 


A 


3904 


414 


1 


STHLGLPKCWDYRHEPLHPASMSYL** I 
FFIYLFTHWSVSLTARMQFAEDRDLIFL 
LHCYISSDSNSTSWL*YILVFVHSPWL 
VTTIALILGFYVFVFVFLFFFETGSRSV 
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Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














AQAGVQW*GLSSLQPSIPPGFK*FSC 


3882 


17783 


A 


3905 


396 


236 


DRVSLLSPRLECNGTISAHCNLCLLGSS 
NSPGL* NRGHQ I FS IEGQ I FLQAMS LS Q 
LLCSALWQKQPYTMYKGMEMAVCL 


3883 


17784 


A 


3906 


377 


57 


RGLVFFFQNPQA* KNLGRGRVFFPSGAQ 
KNP QKGGWAF FGRP PPR FGGR PRGP GAR 
CPLNLTPGGPKGKALGPPPWGWGSKFQK 
GVPLQWVGGNSGLFFSKKKHKL 


3884 


17785 


A 


3907 


341 


3 


TRFFFFSPLGKKGFFSKGFKLGTPRVFP 
LPPFLNPGPGIYFWGPIKKKKTFPPPGV 
KFGS F KRAP PFFFFLMFI AAL FM WKT W 
KQPRRPSVSE**NKLSYL*TTEYFFSIK 


3885 


17786 


A 


3908 


3 


162 


MMRKAI RGHL * NN PALE KLLPH IRGNVG 
FVFTKEDMAPFQKPALLQHHSQYI 


3886 


17787 


A 


3909 


1 


421 


PTRPPTRTRWGFTMLVRLLLN* KHSQAG 
AVAHACNPSTLGGQGRQ ITRTGVQEQPD 
QHGETPSRTRGRTRG 


3887 


17788 


A 


3910 


2 


424 


PRVRTSPGLALSGLTGLKFSKNQKKKGK 
KPQTTKTQKKKKS PKKKKTPKKKKGGAP 
LKKKNFNARGGENFFFF*GPKKKNPGAR 
FKKGGGGKTRGT PKKKAVGKT P FFRGGE 
KKKES PSQFYKKI KKKVFGGKKI FP 


3888 


17789 


A 


3911 


3 


236 


GFCHVGQADLELLATDDLPASAPQSAG1 
SGL.SHCAWPV* PHVMSSLKSLLATRPSL 
HRVKLQKNGLRKTRELYMKNS 


3889 


17790 


A 


3912 


304 


2 ' 


VAKKGKSKPGGLKKNPPKIGLGYPQGGP 
QNQ I WE KFFFLT P KNRWGAPL * GGGGP P 
SLPKKRGPPPPGWSNPPPPFFFFFFLRQ 
S CS VTQTGVQ WRDLE L 


3890 


17791 


A 


3913 


91 


344 


HAAPVCGRRTLKVHPHAPSWPPSSWAHP 
PPTWPEPSGTTWAPEGTYVTLGPPQELS 
LRRGVSYHCPPP*PLIWGRKWPPPLPCL 


3891 


17792 


A 


3914 


328 


402 


KSWGGWPXAVAHVCHPSTLGG*GGR 


3892 


17793 


A 


3915 


16 


179 


GGGCS ELR* RHCTS AWVTRARLS QKKE K 
KKKG P RKGE VATG S VWTRGE L KS WAL 


3893 


17794 


A 


3916 


229 


46 


AGPELLASSDPLTSASQSSRITGIPFLT 
GLYSKDHI IETANI SYTNA*AL* ITLRT 
RGSTR 


3894 


17795 


A 


3917 


42 


383 


AFPCFVIGRVISKGKECTLYFFLFLFFF 
FFFEKKFFFFSQGGKKGGEFGSMEPFPS 
GLKETPLPHLLEEGKFWAPPPPPVYFFV 
F* * KRD F TLVAR WS NRLP S RE KGLD YR 
P 


3895 


17796 


A 


3918 


402 


286 


PPPPF*TPPPPKIFGAKKKKKNSPPPPG 
ESFSF*KRPPP 


3896 


17797 


A 


3919 


349 


380 


KKSPFPQ*RN"FGSLQPPFPGLKPSSPLS 
FPSSWPR*TPPNPANLFFFKKMRISHV 
AQ 


3897 


17798 


A 


3920 


28 


327 


KTTLLNQATI IQAKAGTWGHPPPLSPRE 
NKQNQKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKNGGGGP * KKKNFSGGGGEK 
FFFFWGAKKNFGGGGF 


3898 


17799 


A 


3921 


414 


1 


NFPKVSGPKLMS PS PPHFFFPKKS FPPP 
PLFLEKKNGRFSPPFPEKKV*F*KKPPF 
FFSPKKKPRFPPKVLKVGGKKRVLKNLG 
PGIiLSKKGRGTPPPPPLFFFFFF*DGVS 
LCLiPGWSAWRSQLTVTSASQVQVQE 


3899 


17800 


A 


3922 


1 


169 


LRSCHCTPSWATE*DSVSKKKKKKTLKN 
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nucleotide deletion, \=possibie 
nucleotide insertion 














LSPQKLNRAGKIFLRVFPLEKKGQTQLL 
*NRAGKIFLRVFPLEKKGQTQLL 


3900 


17801 


A 


3923 


263 


74 


LLS TOR I PCAQ KKERKKRYECE CFL I VF 
VFDLSEENSSYISFKPXLCVCVCVCVCV 
CVCVW* VKNFRKS * *SSDRSKTKTIKKH 
SHSYLFFLSFFCAQGILQTERSHLLPHL 
EVSLFKE 


3901 


17802 


A 


3924 


3 


129 


TGFHH I GQAGLKLLTL* S ACLGLPKWWD 
YRHEPPHPAPFCD 


3902 


17803 


A 


3925 


281 


10 


NLPLPGATDYPASGGGGCNEMRSCHCTP 
AWVTGAKPHLRKK* KKKRQMLKTISKEM 
GNFNMVYILDKITK*FLYFKQLILSFLM 
GELGVK 


3903 


17804 ■ 


A 


3926 


110 


1 


QLGAVAHAOSrPS T * GGQGQAEHLRSG I Q 
DQPGKHGK 


3904 


17805 


A 


3927 


180 


368 


EINFITNYYIILFFNRQGLALSPRL*SG 
GAI ITHGTLKHLDSSEPPTSDS * VAGTS 
GTYHHA 


3905 


17806 


A 


3928 


92 


2 


VRPGAVAHACNPSTLGGGEG* ITRSGDR 
DH 


3906 


17807 


A 


3929 


2 


146 


HLNLGGGSCNELRIiHCCTPAWATE*DPA 
SKKKKSFKITSAFTCQIVML 


3907 


17808 


A 


3930 


221 


3 


KERAI WGTELNMVS KIFPI* LKQRKNL * 
PPNI KC YNNLLRP * LGPGTVAHACNP S T 
LGSRGGRITRSGDQDH 


3908 


17809 


A 


3931 


1 


102 


GGRGCSEPSSHNCTPA*VTKRDS ISKNI 
NNKIVK 


3909 


17810 


A 


3932 


351 


69 


GQGGGFRFIQPPSSRGKRFFPPPPPPIW 
G*RIGPPPRANFFFLFF*KRGFFLFGRE 
IFFFPPPGVPPPWPPQNFGFQGGAPPPP 
PPFFFFFFF 


3910 


17811 


A 


3933 


401 


94 


RGPPPLNPPLWEKNHAGPPKVGG^KPPP 
PTKKKPLFS*KTKIFPPGGPPPLIPPPR 
GGKPKKFLYPPKFRFREPKKGPFPPPRG 
T KKKT P P P KKKKKKKKVS 


3911 


17812 


A 


3934 


184 


2 


LKEFFPSRNSRSVLHTSNF*HLFI*LII 
YPFEIESYSVTLAGVQWSDLSSLQPRPP 
GLKQ 


3912 


17813 


A 


3935 


2 


393 


KIL I KVE I EGT YRKI TNAI CEKPTAN I V 
PNRKKIjKAFPLRNGTR*GCPVS P VLFISTI 
TLE VLAR VI YS E K * IKGTKIQISIATME 
NSKKFPQKTENKTTICSSNSTTGCISKR 
KEMH I S KRHLHS PVYCNT 


3913 


17814 


A 


3936 


291 


2 


FPPPPQTRGPKSSSRAHSPPGPGPGSTT 
GAR*PFRPLLLVRASLRPPLGLPTPTGP 
TGGMQRP E A* RT P ATD KPQNS RRG P * GA 
WGRPGCRCRGGA 


3914 


17815 


A 


3937 


382 


1 


QWTAQTWSVQNELLLFCLKPHPPPSLLP 
PSSCPGPHLGTPWTPAFPSHPTSTLKAH 
PIGSGPSSPLLLTSPRMPPEIQTTATG* 
PAS S LT P T VCS QHGNRF FKKG VRAGRS G 
SHLESQHFEGPEAGG 


3915 


17816 


A 


3938 


398 


2 


PPPLLLNKIGAKKKAFFCPS PFGPRKI P 
CGQKTKKSVFPFRVP *DKSTFDCFLCP I 
VYVKKQ I WLAPLHCVFSGRVAQGSKKSP 
LCLSLPFFFFEMESHPVTQAGVQWRDFR 
SPQPPPPGPKPFSCLGLPSS 


3916 


17817 


A 


3939 


240 


385 


KLLFLFLF I YFF * DGVSLCCPGWS AWQ 
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nucleotide deletion, V=possible 
nucleotide insertion 














TLLTE PSAI PGAGGTP VS P I 


3917 


17818 


A 


3940 


230 


381 


LISS S YRE YLAKDLRFS S KQS * WLCS KE 
FRGLGAVAHVCNP S TLGGQGRW 


3918 


17819 


A 


3941 


1 


107 


CWPWS*TPDLKQSTCLGLPKCWDYRQA 
QPHRNLL 


3919 


17820 


A 


3942 


2 


95 


DLLTS*SACLGLPKCWDYRRKPPRPAIiP 
AFS 


3920 


17821 


A 


3943 


163 


404 


CRHKAQWLPQLPGHLLLSEKTLKYSSPK 
GLRSQH*KLLFLFCFG*TESRSVTQAGV 
QWRNLGSLQPLAAGSKRFSCLSLL 


3921 


17822 


A 


3944 


3 


200 


GLDLLTS * SAHLGLPKCWDYRREPPRLA 
YFTFI KKKKKDGCELLNLFPNS CMDTAH 
VHPAHLTGG 


3922 


17823 


A 


3945 


423 


2 


LFFFFSEV*SHSVAQAGVQWYNLGSLHS 
S 


3923 


17824 


A 


3946 


209 


413 


TTCIEQCNKTRRLGAVAYACNPSTLRG* 
GRWITRSGVRDRPDQHGETTKNTNISTT 
KKS TKNTKI S QA 


3924 


17825 


A 


3947 


425 


3 


SPSQGLRDKTPISLGQSTWGKGQMWAQR 
QQT*TSWPDGSEESSRSPSTFEQQISER 
SSSAKGQTNSSSGSLNSVSPDWETPPSR 
SQQTPHTGELQLASGRCSSGMNLPEEGT 
DSNLCCSATHAS 


3925 


17826 


A 


3948 


299 


3 


ASQTTFREGCFLFFSFFISVCLFVCLFE 
MEFHS CRPGWME CNGDRARLHLKTTITT 
TIEIVKDDTNKWKHIH*RLPNIKMSILP 
KVIDILFNAIPIKI 


3926 


17827 


A 


3949 


331 


374 


MAP KS E RAS KAF * A I LEENKE KL PLD I N 
PWHPHGHIFKCVAPPHCLGWDLSAVPA 
S HALG F PQ PQNS AWHVARS KS AASCLGT 
AVTSAGSGVLGTPNKMPQKVRERARLFW 
R 


3927 


17828 


A 


3950 


439 


69 


TNQKWVPPKEKGLFKKKQLRPRVGGPA 
FHSPPLEGQGS * FP *GQRFKPRPPPOTK 
PLFSKNPKNYPGGGAPPFFPPSPGG*GG 
KFPPTPKAKVPFIQISPPPAQAGGPTQV 
PFPKKKKKKSH 


3928 


17829 


A 


3951 


422 


3 


CFPALG I G P PH P P KRVG* KGGT P P PGL F 
FGKFLKEKNFGKNPGPGQGGPFFPPKKQ 
RGP FNNGR P GKVKKKGAPG P GVP KKKP P 
RVPPPPGFFGNPPGAPPKFFFSPRGGED 
QKRKNFFFFFFLRWSFTNEAQAGVQWR 


3929 


17830 


A 


3952 


223 


3 


ALiFFWGRGTLiPRVKGLjNPKGGGGKFFFF 
FFF*ETESHSVAQAGLQWHDLG*LQPPP 
SRFKPFSCLRLPGSSLS 


3930 


17831 


A 


3953 


1 


388 


E KGVS PGGRTCS EQRWCHCT P VWATERD 
SVSKKKGMSNNFSLILRSWNNKGQGGTY 
FQQKFI YPCFLKI KGHS YINCKI FKK* * 
NLSLHVHVTDVNVR I ILQKLDCQIHWVD 
FNFELN I LQCQGLGGCK 


3931 


17832 


A 


3954 


1 | 


396 


KLRPKDQDTWPTQQSAIWEFVPPSVYF 
VKS ISE I LFFFFLKKNFFLGPQLKGRGQ 
ILVKGNPCPGGKKKFPAPPPQRGGKKGG 
PPPPQF I FVFLKKTGV* KIGPGGV* TPA 
LG I AP P F F P KGGE * RGGP P P 


3932 


17833 


A 


3955 


2 


199 


PGGGGCGEPKTRGCTPAWATERDSVSKI 
NK* KLKLKNNS WL YAKS LAS S PTGR* EE 
WGCFTNEHI 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F==PhenyIalanine, 
G=Gfycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=ProIine, 
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X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 


3933 


17834 


A 


3956 


393 


2 


KEFPLFPPGAPPWPIiSGPFSVLEKPKCG 
GGWPGINTPPGGGGFPQGSLWAGDFP* 
PLFSPQIPFFKKQKFFFFLPRGGLWAPF 
FPPFGGPL.FSQNF*KTGF*GGFFFFFFG 
DGVS LC YPGWS AVAPHAS 


3934 


17835 


A 


3957 


278 


390 


L VY I F K I RT C W PG AVAHACN P GTLGG * G 
GRVTRSGDQ 


3935 


17836 


A 


3958 


385 


311 


G PKWP PAS AS Q EGE TTE V * HHAWL I FFL 
FFVELWSSHVAQAGLKLPASSDQS*TLY 
I 


3936 


17837 


A 


3959 


410 


119 


SPPPRFKTPPPEFIFGAPKKKFFSPPPR 
PLNLILLKGPPLFFFFLPKAFKPHINFV 
FPPQKKKJW*INLYFPPPKKENESS*GK 
HFALHCNPSCISK 


3937 


17838 


A 


3960 


77 


400 


TALPLKPKKKKKKKKKKKKRGGPLKKKK 
FKAPGEEKKNFFKGAPKKKFRGRV* KTG 
ERKKPGGNKKKTFGKKPPFPRGEKKKKP 
QA 


3938 


17839 


A 


3961 


2 


194 


CL* SQLLGRLRWEDHLSSGN* GCSEP * S 
PHCTPAWVTE *DPVSRKRKKYPVVNQTL 
RGFLSPAH 


3939 


17840 


A 


3962 


52 


409 


NSKTSSQEKKRRVIRLSLRSRVGDDTKV 
NTRRWGVFGWHLKNLPTTLVLKKMEKIL 
ETYFMTPDLHDPHTTS YFNGCQLQAI KV 
GGTITDLIVF* I VS KPNAAGL S LNENVN 
AKFNDLI 


3940 


17S41 


A 


3963 


382 


96 


LS SQLLREAGERNE * TGWLQQPLQMQ IH 
KNPCIEPRSRYCTPAWATEQDPVSKNKQ 
TNKQTKNSLHL 


3941 


17842 


A 


3965 


3 


169 


DAW E LRS HR C P PAW * QS E T P S Q KKKKRG 
KKKI FRFYLFFNKRKKFLNGGNLGVI P 


3942 


17843 


A 


3966 


1 


121 


AGFHRVSSDGLDLIiTS*SARLGLPKCWD 
YWCEPPHIiAENS 


3943 


17844 


A 


3967 


200 


2 


PPFRIYWGEVLTPLFLRVKPKFFKPWGF 
PFCLFFFFFFEMESRSVAQA*MQWRDLD 
SLHPPPPAFK 


3944 


17845 


A 


3968 


471 


440 


MELRFYHVGQTGLKLLTS * APPVMASQS 
AG 1 LG VS HS T W * E APNS LAI T FL S S HL S 
PRQP*THLGC 


3945 


17846 


A 


3969 


214 


2 


EGQ PE CQLQAGDW VS LWNLVCSGLE VRA 
WREAPRLEWSFCLFFVSEM* SHSVI * DR 
VQWHNIGSLQPLPP 


3946 


17S47 


A 


3970 


53 


398 


GDLPKKQQQQKPLNFCFQLLLYARLFGP 
I FLV* HliATYNNKHLLECVGKVM INCQA 
DILNKME I SKVGAQKKKKKKKKKKKRGG 
PLKKKKFI PRGGEENFFFLGAPKKKTGG 
GFK 


3947 


17848 


A 


3971 


396 


160 


KRE R I KE KEDNKE E KD KE KKAS GKKE RK 
KEKEIRDRESKEESSNLLLQEELSLCCP 
SPGHPPNSSL*DPYFATGNSLRC 


3948 


17849 


A 


3972 


416 


3 


FPFPRVPEGVPGGEGVFPPGFFGKSGVP 
FPRPRGGGAP I * KPPPFPGGPPNFFF * K 
KTGPPFSPPQGAPPAKRAPPPFFGGGGG 
KIFTPLKKGGLFFFGPPKKGVFFGPKKT 
L FF FQ KKKKKKKGRP PRSHAS AHAS 


3949 


17850 


A 


3973 


232 


381 


YKTFSLIF*LFFFRDRVSFCHPDRSAW 
IiS*FTVASISWAQGILLAQPLE 


3950 


17851 


A 


3974 


396 


141 


CFFPPPPKRGFFPTPIFGGPPGFFFPPF 



792 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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F*NP PPKI FFGAPKKKKI FP PPGGKNFF 
FLKGPPPFFFFFFFFFFFFFFFFFFEDG 


3951 


17852 


A 


3975 


286 


422 


RKQL I S PHSQDNS KQKEQS WRHHATCLQ 
TILQGYSNQNS * * AP IAKTILS KKNKAG 
GITLPVFKLYYKATVTKTAWYWDQK 


3952 


17853 


A 


3976 


3 


116 


GFHRVGQNGFDLLTS * SACLGLPKCWD Y 
RHEPPRPAR 


3953 


17854 


A 


3977 


223 


399 


EGPRTQS FFDTE VHS VTQDGVRRRNLGS 
LLPPPPGIK*FFYLSLPSNWDYRRLPPR 
PVN 


3954 


17855 


A 


3978 


227 


422 


SFPSPCIFFSLVDLGVSL'YCLGWSQTLD 
LKGSSLFSFPKCWDYRQ*ATTPGLPVYF 
FPFPVFTFF 


3955 


17856 


A 


3979 


519 


313 


KKNLLNPGDGGCRE IKSSHCAPAWVTEK 
DCLKKKKKKTSVCVCVCVCVCVCV*NHL 
SSLGLFPEPVGLN 


3956 


17857 


A 


3980 


217 


414 


IYIFCGDGVSLCCPGWLKQSFCLSLjPKC 
*DYRHVPPRPTSFHLLTNIGVLQFLKLQ 
ATLSFTLFLL 


3957 


17858 


A 


3981 


2 


409 


KLTFAV* D Y * EKLLLSnAS LTS QAAFLF 
PEHPYTVISNPCSPANYIFTLSSGTTFH 
QFYAKS * CVTYE * KKKNLLS PPTGDKPN 
*GRSQTKKIQTKY* SEICFFVHLFETGS 
HHVAQAGLELLSSRDPPTSASQST 


3958 


17859 


A 


3982 


234 


416 


S VL P S T P L LNKL * KS KKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKSRGGAPKKK 


3959 


17860 


A 


3983 


414 


291 


GGRACSEPRSCPCTPAWATERDSV*KNK 
QPNKQKNKKDLL 


3960 


17861 


A 


3984 


192 


415 


TNANKLDNLDEMDRFLETQNLLRLNNEE 
TE YQNR P VTRE E IE* VI KNLIj TEKKNPG 
PDGFTGEFYQTFKEKLIP 


3961 


17862 


A 


3985 


403 


145 


AFFFFFFPPGEKGDFSPTVLFGGPPGFS 
PPPVFKTRPRNFFLGPPKKKYSFPPPGP 
KNWFP*KGPPPFFFFFFFFFFWKLGIEV 
IS 


3962 


17863 


A 


3986 


255 


425 


GGRFKGSNFTSAGLQGFXXXXGPPKFI S 
RPGV*QRGEWKNPGVPNLFALSTSPFGR 
W 


3963 


17864 


A 


3987 


247 


2 


E E AVRVS NL» YT KLD E E Q * VVKKQLN Y VE 
RLKK* ELFLQDLYRKVPLKI * I F* VKNL 
NS F WPDAVAHACNP S TLGGRGGW I T 


3964 


17865 


A 


3988 


1 


394 


HHHTWI IFKFFCKDGVSLCCLSWARTPG 
LKRSSCFRLTKCWDYRHEQLCPAFKHFF 
LY*HNFMSTEKMQE*HKNS*YPSPTVPK 
GVTFYYISYICFIILILSAITYKFVHTY 
IVSLNHVCYSNEAPLSLWI 


3965 


17866 


A 


3989 


208 


402 


E TDRRIiKS GGS L S KTIi I KCMT C FWFRD 
GVSLCCPGWIQTPGLK* SSCIS LPS SWD 
YRHMPPCPA 


3966 


17867 


A 


3990 


2 


408 


THRPYHAT P TYLP F YTNNL I KGGKKKKK 
KGGKKKKKGGPPFKKKHFSPPGGEKNFF 
F YGPQKKKSRGRV* KRGEGKNPGAPQNK 
PLGKNPLFPGGEKKKKPRFAS*PPPPPF 
S FGKKRAP PTKYPGAGLHP 


3967 


17868 


A 


3991 


81 


406 


KGGLFLLPRGKGKGKIWVKGNLGLKNKK 
MFPA* PSEEGEKKGGPHQAGKILKFGGK 
RGLNKGGKTGQKLKPGGI PPPGP * KVGE 
KKEKPPAPGQKLFFKKRKGQKKNF 
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3968 


17869 


A 


3992 


396 


64 


YSSPPPTKKTDDSDASRGPFFFFPPPEK 
RGFFPNDFFWVPPGFFPPPGF*TPPPVF 
FFWAP*KKIFFPPPRE*KFFFLKGPPPF 
FFFFFFFFFEHLQVLLIC 


3969 


17870 


A 


3993 


367 


193 


KLWEKKKILWKKG*GEIKSCHCTPAWAT 
RVKL CLTKKKKKKKKKGGEGE I FKKKS F 
S FLGKQ KKKRGPAF F 


3970 


17871 


A 


3994 


188 


2 


KRANTLPDFKTYYKAIVLKTLWYWRKDK 
HRSMKQN*ESKEVELLYHNIHRNKLKMD 
HRLKCQ 


3971 


17872 


A 


3995 


2 


147 


QENHLNPGG* GCS E LRS *HYI PAWATE * 
DSVSKKNYI * *TSHI FSGDF 


3972 


17873 


A 


3996 


23 


329 


RNS F I RS AFSNVLNKVVLFLTS E KKKKK 
KKKKKKKKGGRKREYLTRRGAGENNKYF 
YK* KKKKNTCG* KKKGWGRKREREKKRL 
GGKKKFPPFEEEGKKKPG 


3973 


17874 


A 


3997 


164 


392 


KPLGRAGLVPCLCSSFTGFLM*KTEPCS 
WQAGVQRGHLCSLQPQSPMFKRFSLLS 
LPSSWDYRVLLRHPSWSAW 


3974 


17875 


A 


3998 


396 


3 


FFFFFPPGGKWGISDSPQLFFSGVKNFS 
PPPPPVNWGLRVWPPPQKNFFFFF*KKV 
FSFFWGGPPNPPFFFSKLLGGPPPPPIF 
SGEKIPPSLFFFFFFFEMESCSVAQAGV 
R*RHPGPLQPPPPWLKQF 


3975 


17876 


A 


3999 


383 


2 


VPSPEKVGIKKIQPLDPFF*FLGDCQRK 
KKSPS*RPPPPPEGKKLN*IRLMGPPFN 
SKEKFF*KF*AFKPFFFPPKGEGI*GFF 
SRAHSFPNQRALFWFFFFEMESHS I TQA 
SMQWHDLRSLQPPPL 


3976 


17877 


A 


4000 


213 


1 


RGRQNKTLFLTKPNQTKPNQTKPNQTKP 
TT FL * S VI E I RVKKKYKTTRKTTKNPTT 
SLTRSLGRYKSPTRP 


3977 


17878 


A 


4001 


412 


129 


GSLSSPRLECSGTIPAHCSLNLPGSSSP 
CTSAPRVAGPE*MGQACMPPCLPVNFFF 
WGGGG I LrVE TRS C * VAQADLELL S FS D P 
PASASKVLEL 


3978 


17879 


A 


4002 


411 


293 


RDTRCLPP * LDNFVFLVEMGFHHVGQAG 
XiELLTS S EE I A 


3979 


17880 


A 


4003 


46 


392 


QDFFFFFFFLGKGAPPPPGKKGGTPFPK 
KKTKGGGPRGPPLEPPPLGGPGPPEPGG 
GDIiKKP WPPQGTPPL * KKPKMGGGGGGP 
P*TPGTGGGGEKNPLTPGGGPKNGPPPP 
PPG 


3980 


17881 


A 


4004 


360 


1 


KCLGFQFFFRFWNICI IFISQLS XASQV 
QKSEIQNAPMSIFFEHHVSFKQFQILEH 
FGFWI F I LGMLST YKTL I LLKLN* RFNA 
MPlNIPWIFMEIDKLrLKFTWKYKEPR 
RVKAILRN 


3981 


17882 


A 


4005 


152 


408 


NKFVSHFYWMGNVLSRYPQFT*QFSFCH 
VRLPS VF * YL YTRNKI L I FFAHKAFFIjD 
AKYVCVCVCVCVCVCVCVYFGIGEGNLP 
S 


3982 


17883 


A 


4006 


411 


67 


YLPPPSSPTPGVFFFFPPPKKEFFPNPF 
ILGTPRFFPFPPF*KPPPKFFFLGPKKK 
K* FSPPRGKKI FFFKRAPPFFFFFFFFF 
FFFFLLGGSLS FMYFHIKTYK 


3983 


17884 


A 


4007 


217 


2 


PLFFFFKHLKAYYS DVIHI S FAI LRYLL 
F * DRVSL CHSGWSAVARS WLPVAGQNTF 
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ITRQEAPYLLAIPRV 


3984 


17885 


A 


4008 


235 


406 


GRHRVS SNP I S I * RRRGL S LL PKLE C S G 
VS IAHCSFDLPG* SDPSASASRLLGRLR 
Q 


3985 


17886 


A 


4009 


833 


237 


SLDNRARPCLVSFIFKKGRSTAERCGAP 
GTTSPPFPRLSERGHPIIPS *RGWERKS 
QLCSHLQSSRDPSGLGRAGCGGGGDLRE 
S PGTFPGNGGSDLVP VPTAP I PIEPQ I C 
QATCAVMKLSFDEEYRRAMNELGECPRS 
SGKPSSSPRTGGGSSAAKGTQAGSSELG 
LGIGARRGTDSLESHSTSHEWTVTLGPL 
SP 


3986 


17887 


A 


4010 


276 


2 


VE VKGNNS FSEGS ITL I PNPDKQPKETT 
DQPLMSTYAKXESPGKNPSHDG*LIFDK 
GAKI IQ*VKNS I FNKWCWDNWDFHVKTM 
NPHFTLI 


3987 


17888 


A 


4011 


188 


27 


REKFQ I YTI KPJDKGS I TTDTTER*KIIR 
DYYEHLYAHKLENLEEMHKLIGIPP 


3988 


17889 


A 


4012 


1 . 


202 


CLHQKVIISN*L*ATTQTTQLSLSFKLD 
YFS 1 1 1 IPVALWTWAI I KFSL* YINS P 
PKKKEAEKDLN 


3989 


17890 


A 


4013 


1 


403 


CLDQEVIISN*H*ATTQTTQIiSLSFKIjD 
YFSIIFIPVALFVTWSIIEFSL*YINSP 
PKKKKKKKKKKKKKKRGGAPL KNSRGGP 
NFWGGGKNI FFFFGGGDKKPPRAFWKKP 
LFLGGGNLGPPPPQKFTPWGKK 


3990 


17891 


A 


4014 


67 


439 


TELIIFYIFFETTHjIPTIjAIITR*GNQP 
ERLNAGTYFLFYTLVGSLPLLIALIYTH 
NTLGSLNILLLTLTAQELSNS *ANN1,I * 
LAYT I AFMRKKTKKKKGGGLLKDPWGGQ 
ILAGREKIKFFP 


3991 


17892 


A 


4015 


3 


260 


LIVPTIILLPLT* LSKKHI 1*1 INTTTHS 
LIISIIPLLFFNQINNNIiFSGSPTFSSD 














F 


3992 


17893 


A 


4016 


258 


3 


LKKIFSPPTINLFFPPFPLKKFFFPLSL 
* F FLG VFS P FFP P P KKGFF P K I P P G VF F 
SPPFKKKKFFFPPPFYFAPPRVFFKGPP 


3993 


17894 


A 


4017 


287 


2 


KKQKNFIFFFKNLSFFGRGAFFPPLFPP 
ILRGKGGDFFYPRGLKPALGNQ*NPFSR 
F* IFFFFFFFLRRSLAPVTQARVQWHDP 
GSLQPSPSGFK 


3994 


17895 


A 


4018 


228 


487 


SYDVKQEKPNGI * AS KY WAY VFHS F FL S 
FFFFFFKKKNFFFCPQGGGQGPNLSLRE 
PSPPGVKKLFGLNIiSKSWE*QNCPPPPI 
IF 


3995 


17896 


A 


4019 


35 


228 


EELNHLNQGGEGCSEQRSHHCTPAWATE 
* D S S QKKNKFW Y I H P CNR I VT T TNT PMT 
ESPKHYAE 


3996 


17897 


A 


4020 


2 


402 


ARGNLNTTFFDPAGGGDP ILYQHLF * FF 
GHPEVYILILPGFGIISHIGTYYCGKKE 
PFGYIGMG*AMISIGFLGFIV*AHHIFT 
VGIDVDTRAYFTSATIIIAIPTGV*VFS 
* LATLHGSNMK * S AAVL * ALG 


3997 


17898 


A 


4021 


1 


420 


TRGATELI ILDI L VETTL I PTLAI ITR * 
GGQP * RLNAGT YFLF YTLVGS LS LLMGL 
IYTHKTLG* VNI LLLTLTAQEL*NS * AN 
NLI * IAYTIAFIGKIPFYGLQIi*LjPKAH 



795 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A1anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
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VEAP I AGS IGLAAGLDKLSGYG I IRLTL 


3998 


17899 


A 


4022 


409 


3 


LEMAS CL 1 1 CNNNNP FF YQ I VT CDE KW I 
LYENW * LPAQWLH *EEASKHS PRPNLHE 
KKIMVPVW* SADGL I HYS FLNP CET ITS 
E*YAQQINVMH*KLQGLQLTFINRKGPI 
FLHNNTKLHVAQPMLQKLNELA 


3999 


17900 


A 


4023 


464 


374 


HLP P WED KARRHL PDART F I LAFLASRN 
VRNKFFSLQATQFLVLYASSTKITKPLK 
TNQKKRLKKSEQSLRNMWTPIKRTNICI 
VGIIEGKEGEKGAERIFEEIVPQTF*NL 
VKDIK*KVQ 


4000 


17901 


A 


4024 


58 


323 


RHKLLCEHVKKCSAVLIIREMQTKTTLK 
YHFLLVT I AKI FNL IHF *RSVGE TL I CQ 
WAGRNAKS * P FWRR I SQ YLTKLYVCVCM 
YTHS 


4001 


17902 


A 


4025 


480 


150 


RNSLLRRLRQEDGMSFGCQGCSEL*LHH 
CTPAWVTEQDPVS KKKKKKKAYPDP I P K 
KTDKSNGKKGPDFSRKNP PKNS P PP PWQ 
RSTAFRGRAAKSCPTH 


4002 


17903 


A 


4026 


204 


1 


KKKFGALFFFFMQGLTLLPRLEYR*WCD 
HGSL*P*PPGSSDPPTSVSQIAGTMGVH 
HEFLRSGWLSK 


4003 


17904 


A 


4027 


116 


472 


MPPLPKPVSGPRPHSCSPLGGPQLSTPL 
TGPRPASLSLPISEGICGCITAILWAVW 
YLVLQMLGLSLFMA* RGAGS s csahkkw 
wlglas p s c * t F F KNRTKGRARW tiT P VI 
PALWEAE 


4004 


17905 


A 


4028 


355 


444 


VS YKKK* S RPGAVAHACNP S TLGG* GGR 
IT 


4005 


17906 


A 


4029 


410 


181 


ASIILL I AVFFNN I LSGQ * T I TNTTNQ Y 
SSLIIIMAIAIKLGIAPFHF*VPEVTQG 
TPLTSGLLLLT * QKLAPCF 


4006 


17907 


A 


4030 


49 


474 


P XRXXF CH I VT Y Y S GKKE P FG Y I GM V * G 
MISIGFIiGFIV*AHHIFTVGIDVDTRAY 
FTSAT 1 1 1 AI PTGVKVFS * LATLHGSNM 
K*SAAVL*ALGFIFLFTVGGLTGIVLiXN 
S SLD I VLHDT Y YVXAHFHYVLS I GAVFA 
II 


4007 


17908 


A 


4031 


274 


427 


FFF*EMESHSVTQAGLLGRNHSLiLVPQT 
PGLKRSSLLSLLSNQYYRSGPHT 


4008 


17909 


A 


4032 


205 


393 


TWGGFKNPSPGEG* IKKMHDFNEMFLQN 
CWKI LFFGGGFFTGQI FPRAAPP INKNI 
LVWGGGK 


4009 


17910 


A 


4033 


2 


386 


SEPLSRKKLYLSIIiISLQISLIITFTAT 
EDI I FYI FFETTLI PTLAI I TR * GNQPE 
RDNAGT YFL FYTLVGSIiPLI» IAL I YTHN 
TLGSLNILLLTLTAQELKKKKKKKKKKK 
KNIGGPPLRAPSGAPN 


4010 


17911 


A 


4034 


3 


169 


GLFLAMHYS PDASTAFS S I AH I TRDVNY 
G * 1 1 R YIjHANGAS I FFI CLFLHI GRGL 


4011 


17912 


A 


4035 


3 


317 


HAS ADAWAF I G YVL P * GQ I S F * GAT V I T 
NLLSAI PYIGTDLVQ* I *GGYSVDSPTL 
TRFFTFHFILPFI I AALAALHLL FLHE T 
GSNNPLGITSQKKKKKGGPF 


4012 


17913 


A 


4036 


446 


194 


KKKNQPGGP PPPPAFP VGGL I KKRFPPT 
PGKKFKFR * KFFPPKGGFFVRDPPPKPG 
CWEREKKKKPLLWTDKWVILEGS EAGLF 


4013 


17914 


A 


4037 


213 


1 


GMAKVKARFLPQKRGQRVLNPPWWGKGF 
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PNSGTKETPFFFF*DRVLLCQPGWSAVA 
Q S * LTTAS TF WAQA I 


4014 


17915 


A 


4038 


307 


36 


RKGFFFSKCGEFKKLPGPPLGGWYLKLI 
FCQKKFFHPKFPGFPG* IFLPP PKIFFI 
FSFEMEFCSCCPGWSAMGRPRLTATSAC 
WAQANS 


4015 


17916 


A 


4039 


2 


372 


NDTS LTVKS IGHQWE * TYE YTD YRGLVF 
NC Y ILPPLFLEP GDLRLLD VDNRVI L P I 
EAP I R I INTSQD VLHS * AVP TLC * KTDA 
I PGRLNQTTFTGTRPEVYYGQCSEICGA 
HRRSMLIVLELI 


4016 


17917 


A 


4040 


107 


279 


LLLLF FFFWRKSLAIoAPRWE CRGKI *GH 
CKLRLPSPCHSPA*ASPEAGTTGWTYGC 
F 


4017 


17918 


A 


4041 


3 


351 


GSHYIAQAGLKNPGSSHLPASAAHSAGI 
TGVSHHAWLKIFFKESVSDYSPHLNSRK 
NAVYVS FFFFWKKS FVFVPQAGGKGLNL 
GSLKFSPLRLKQFSCIiTLPRS *EYGLAP 
PPPV 


4018 


17919 


A 


4042 


1 


385 


RPRRPD IE* QERRTQE VLQAVAKKVKE E 
SQLPGTGGPKNVLQPVPRAKAERPKKQA 
EASGL KKE TD WIiKVDAQE AKTE P FTQG 
KGGGQTTPESFEKAPQVTE S I * S SELVT 
TCQAETLAGVKSQEMV 


4019 


17920 


A 


4043 


3 


282 


TAILIQTP*SFTGAVILIIAHGLTSSLL 
FCLANSNYERTHSRI I ILSQGLQTLLPL 
IAF* ^LLASLANLALPPTINLLGELSVL 
VTTFSVDAA 


4020 


17921 


A 


4044 


2 


407 


GTRLE IMSRNHGI FPFTLE I FKDNEFEE 
P YREALP TLKLRDS L YFGIE PEEHVSG * 
ESLEESCFATPTSKIDEVLKYYLIRDG* 
VSDDSVKQYTSRDHIxAKHFQVPDFKFEG 
KDH KE V I LH * RDL VCG VMDE R S R 


4021 


17922 


A 


4045 


4 


380 


ENWLTI IRQAWHEADRNLNTTFFDPAGG 
GDP ILYQHLF* FLGHPEGYI L I LPGFG I 
ISHIVTYYSGKKEPFGYIGMG*AMISIG 
FLGFIV*AHHIFTVGIDVDTRAYFTSAT 
1 1 IAIPTGVKVFS * LATLHGSNMK*S AT 
IIIAIPTGVKVFS 


4022 


17923 


A 


4046 


2 


94 


ADRNLNTTFFYPAGGGDPILYQHIjF*FF 
GHPEGYI L I LPGFG I I SHIAT Y YSGKKE 
PFGYIGMV*AMISIGFLGLIV*AHHIFT 
VGIDVDTRAYFTSATINIAIPTGVKVFS 
* LiATLHGSNMK* SIPTPILI FRP P 


4023 


17924 


A 


4047 


376 


138 


LLARQRQENGVNPGGGACGEPRSRHCTP 
AWATE * DS VFLLWREALLTQTAP FNRLQ 
LLALPSEFFHFRILIVLQPHHSQ 


4024 


17925 


A 


4048 


145 


226 


E P I Q AC S S RRVCE P I Q ACS S RRVCVCVT 
PS RHALPGCVCV* E P I QACS SRRVCVCV 
RAHPGMLFPACVCV* EP IQACS SRYVCV 
RAHPGMLFPACV*AHPGMLFPACVCVCD 
P IQACS SRLCVCV* ERAHPRLLF * ACVC 
VCESPSRHALPGVCVCVRAHPGMLFPVC 
VCESPSRHALPGVCVSPSRHALPGVCVC 
V 


4025 


17926 


A 


4049 


2 


414 


ADRNLNTTFFDPA*GVDPILYQHLL*FF 
GHPEVYILILPGLGI ISHIVTYYSGKKE 
PFGYIGMV*AMISIGFL,RVIV*AHHIFT 
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VGIDVDTRAYFTSATI I IAIPTGVKVFS 
* LATLHGSNMK* SAAVL* TLGFI FL 


4026 


17927 


A 


4050 


472 


228 


LTRDGGACPFSKLFRRVRQKNPLVLGE * 
SCSEPKLCPCPPAKKTKDYPVSKKKKKF 
I W I INRTKI PRKRKKNKAYHKVKYF 


4027 


17928 


A 


4051 


238 


3 


RRHGSFAPLPPGLPLCISYIWLFITILC 
NILYLYFETGSRYVAQVGLKLLCSSEPP 
PSTS * STEITGMSHGARPHNI P 


4028 


17929 


A 


4052 


225 


172 


IARYPFHIQSCTNMPSPNKSITSKEEDN 
GPGAVAHACNP S TLGGQ GGQ I TKSGVRD 
HPG*HGLV 


4029 


17930 


A 


4053 


471 


30 


GPGGIKISGALFPLNKVKRVPWFPRPP 
LIFPQKGVFNQNGGDPGGGFWPGETPGR 
VPPPPGVFFVKSPFLSLPFPPPPV*KTA 
VKTGFPYRPCGVPPRGLVFPKKKKSVLN 
RPDLSMGPQPE*SLPLPGEPSVSHAGCL 
PQAWRG 


4030 


17931 


A 


4054 


425 


41 


NWGSGTLCPQKLGVFPPPPPGEGGVPGA 
PPPARLFFFFF*KKGFSPVGRGGFKFLP 
PKNPPPLPPQKVGIPEGGPRAGPIFSPP 
QLFFFFLTWSLTPSPRLECSGAILAHCT 
PAWETEQDSISKQTKK 


4031 


17932 


A 


4055 


312 


1 


I KNARAHLLPQGS PPPL I P I WI IET I S 
LLIQPIALTVRLiAANITAGHLIMHLIGS 
AALAISTXILP * TRIMFTI I IERTILEI 
AVAL I QAYVFTL WS LYLHA 


4032 


17933 


A 


4056 


648 


123 


D PDP VPSARGCVP S GAPGRGAQLKKAWD 
GALA* LPPCLCAQELPSFS LGTGGPAVG 
L.SWSERDGRLGEVS INAEFTVAAFLKG 
TQSRRGINGMPLPQREPSSRYCLGLGGE 
SSAQGTSGLTGQRV*QNSDLPELASGDD 
KGQRSSQGGAVSQSSPRGRQIPPSSPPA 
ALFNFL 


4033 


17934 


A 


4057 


135 


464 


QHNSRLMQQAKKGVTVLARVIDLDYEDE 
ISLLLHNGGKEE*AWNTGEPLGCLLVLP 
CPVINVNGKLQRHNPGRTTNGPDTSGMK 
VWVT PAGTKPQPAE VLAEGKENKECG 


4034 


17935 


A 


4058 


3 


575 


LRSRPLHIILSCGLLVTPRSLSPPPPPQ 
RLRLCRPSRGAAEFFFSLRTKLHFATLP 
LRIEGS KTL*NCC * LYHGC * IS* IHHGG 
LFDVAAKNI IHEE VEKYDKQYRGKELLG 
FAT Y KT FE 1 1 VHQ Y I Q * L VE PAL S MLQ K 
AME T I Q QAF INVANKHFGE F FMLNQTVQ 
STIEDIKVKHTAQAEYMIQLQVR 


4035 


17936 


A 


4059 


389 


3 


TFKGPKVFRYPTFFLCPPPRGFFSKNIW 
TPPPPPFFGVFPEREKIFFFLKGALPLF 
SFRVTPFFGLKRGFFLSPTPPFFFPPFF 
WPPPFFFFFFFFFPLRRSFALVTQAGVQ 
WRDLG* LQPLPPGFKR 


4036 


17937 


A 


4060 


397 


1 


KRRGNFFFKKGISLFI*TPPRGPLWVFF 
F*KAPKGFCAKFFPFFYPPLPKENGKTH 
RQ KT P F PHF CQ I F KNGF WVLHAF S LGD F 
PARVWGPPFKRFFKGIFKEFPPYGSPPP 
KGPKKKKRTAARD LELADAW 


4037 


17938 


A 


4061 


38 


410 


AWIS I * ERPFTLVRVFKNFLAI SGLLFS 
YFRINFLYVQIHIHTFFRKLFVLSYVFR 
FTDVKFFFYILPFFPPHFLFGWICSYAA 
L*FFIYFCFLPFFFNGPTSP*IGLFLLI 
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FFFKKPTFGFGG 


4038 


17939 


A 


4062 


405 


81 


GGANVFPQKS PP FGNKKGGHFFSYKI LN 
NP GH P P FLKKKKNAP GGWGNP WF S P P W K 
GAL* PKKKNPPKKKKKSPPPKGGKKKNP 
FPPKKKKKKKKA* GDNFDMGTGN 


4039 


17940 


A 


4063 


415 


239 


RPSFFFFPPPKKSCFSTFFFFLSPGFFF 
PPPFFFTPPPFFFFFPPKKKKYFFPPPG 
KKFFFF*PPPPSLFFFFFFFFFVFFFFF 
FFFFFFFFFFFFF*GLTSPHKQIGLVLiA 
FLLAbS KITHAS I PVPVS S P S KKKKKKK 
KKKKKKKKKKK 


4040 


17941 


A 


4064 


95 


241 


SPCRSPHRWVNS*ANNLI*LAYTIAFIV 
KIP L YGLHL * L P KAHVE AP I AGS I VIiAA 
VLLKLGGYGIIRIiTLILNPLTKHIAYPF 
LVLSL 


4041 


17942 


A 


4065 


28 


288 


YFD I FVEARS PYVFQVGLELLGSSNP PA 
SASQSVGITGGSHCAQP*VTILFTTCTL 
HWG*I*CIKFLIHNIPILVISLQYIFG 
FFL 


4042 


17943 


A 


4066 


338 


134 


PGGGCSEPRSGHYSHCTPAWQ * SETPS K 
KNKKKKITYFSVKNWGIRVFIEIILNSF 
TTVRP 


4043 


17944 


A 


4067 


26 


365 


WVSLMTSALAMRVDFYFI TLL I LGLLTS 
TLTIYQ*WRDVTRESTYQGHHTPPVQKG 
LRYGriLFITSNDCFFAGFF*AFYHSSL 
APTPQLGGHWPPTGITPLNPLEVPDLNT 
S 


4044 


17945 


A 


4068 


491 


174 


TQLKTH * GS TTDFRTE VRVE PRVRTNYK 
DLLKFL*SKGYDFESETETETIAKLVKY 
MYDNRESQDTSFTTLVERVIQQLVLSHI 
FKDNYANINAEKLSGN 


4045 


17946 


A 


4069 


210 


402 


NVS KGLRKVP S T.Q * VINI S CRPG WARA 
CGPSTLGG*GG* I TRS G VRD Q P GQHGE T 
PSLGPGAV 


4046 


17947 


A 


4070 


202 


I 


DSAIALQPRQ*EQNSVSKKKIYRTIDLR 
SEYFARWSFTLAARAGVQWRNLGSLQPP 
PPGFKRFSHAS 


4047 


17948 


A 


4071 


412 


143 


FLGAGV* PRLGPKG*TPFLLKNQKLIGH 
GGGAL YS QYFGGWGRRNS FNPGGKGFTN 
QNFPPSLQTWGKKGVSFSKKKKKERKTR 
KWGGE 


4048 


17949 


A 


4072 


2 


324 


RRGNFCMF*SDRVSSCCPSWS*SPGLKR 
SSCLSLPKCWDYRFEPLYLARLVLMRCY 
STHNTYIMYCQMQRLFQGHRYLSVSTSS 
ATRPYLDNFFFFCKNKVMPCCQG 


4049 


17950 


A 


4073 


2 


326 


RRGRML P RT P GRPGNDAQ Q P QKH SLVD * 
LFIYLFYFLWTGSHSVAQAGGQWRNHGS 
LQP* PPSLKGSSRQSAGRVKGVSHCAWL 
LLDFLS FGKP YT YKKVDKWS FEA 


4050 


17951 


A 


4074 


422 


116 


EIKQEKNPGPFFFFPPPAKRGFFPTPLI 
WGPPGFPPSPFLKRRQGGFFLGPL*KGN 
PPVFPPPKVFFFWAGPPFFFSFKKKFFF 
LiVPQSTFIHFIFHICYLL 


4051 


17952 


A 


4075 


3 


294 


VF CHVGQAGL E LLDS SNLPSVASQSAGI 
TDTSHLT*PDLSFCCKTKFD 


4052 


17953 


A 


4076 


60 


389 


PSAQI»GSTKHIHMAVQP*PPPSPELLHL 
PKLMLCPHSAHTPRPPPNPGTVCTYSPR 
SSPAFPQEGPSSGEGLGGLVSLiFSSPPN | 
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HVHPAPTSPESGRGRGGRVPRSGAYV 


4053 


17954 


A 


4077 


93 


385 


KPLFAPGVLFGGPGEPKREKKGGAQGGP 
KKFSPPPPPFSFPPPKTLWVAPLFQETQ 
GliRL C P P F F F * RGKGAL F * KRGGGR F KP 
PPPPPGLLGPPTP 


4054 


17955 


A 


4078 


2 


365 


RKVEGRVS * DEDLiKLTELLRYYMLNI E A 
AKDLLYRRTKALIDYDNSNKALDKARFK 
SKDLKSDCAHPRDC*RA*APYFLFAKNE 
LWPTGEQRHFLQDVPLIVQRTLTLRTP 
DHTSLPLSL 


4055 


17956 


A 


4079 


6 


298 


PLAIMGGFFTLAETNRTPFDLAEGESEL 
VSGFNIEYAGGPFATjFFIAEYTNII I IN 
TLTTT I FLGTT YDAPPRP * LSPSLFYYE 
PPSPYPTPWSTST 


4056 


17957 


A 


4080 


414 


154 


PI PATREGKAGNSIiNPEGEGCMKRRSRH 
CTPAWQEGKTPSQKNKKVLLLALKV* S I 
FHLME S * KYLKPGFSLDL I TVI PTYFDL 
FRC 


4057 


17958 


A 


4081 


347 


1 


VLKPRPGNIIFSPKKKKNLPPPPGRNYF 
FFFPPPPFLFFFFFFFFFFFFFFFFFFF 
FFFFFFFF*RGGLIiLFRLKIQL 1 PPTPHP 
YLSNTLRGWNPGEGSCWRSEGGRGEDCL 
QDH 


4058 


17959 


A 


4082 


1 


354 


STFIISLFPTTIFMCLDQEVIISN*H*A 
TTQTTQLSLS FKLDYFS 1 1 KKKKKKKKK 
KKKKKKKKRGGGGIiKKKKFNGGGGGKKF 
FFWGGKKKNWGGGVKKPGGGKKRGGKKK 
KGGGEK 


4059 


17960 


A 


4083 


384 


85 


FFFPKEFFGCPRFFFSPPGFLTPPPLQV 
LEFPIKKKLSPPPPPLKNFFFYTPPPFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFLLM*AKKFISWR 


4060 


17961 


A 


4084 


1 


122 


FHRFGRDGLDLLTS * SARLGLPKCWDYR 
REPPRPAFLPQP 


4061 


17962 


A 


4085 


223 


401 


NGKLVS VFFGFCFCFFEM* SRCVAQARV 
QWRIFSSLHFLPPGFERISCLSLLSRWD 
YRC 


4062 


17963 


A 


4086 


160 


408 


ALPPRRGGGGAPSFNFCLLPPGP*GKKG 
GYGDAP S CRSGLGQHGETAAPLKAQKS F 
GHGGRG YQ VRRRVG P P G PGGRRQ KLNEG 
APPPPRLGGKAAP 


4063 


17964 


A 


4087 


261 


81 


YDIDGRIFCQKLESSHYSPGAVAHACNP 
SPLGG*GGQITRSGVQAIHPSWPPEVWG 
LQA 


4064 


17965 


A 


4088 


428 


140 


QPLKNPPLGGRAPPFLRGKKLPPPPPQG 
EPPPF*KKKKKKRGGGGPPFFPPPPRGE 
P KKS L * PQKGGGKKKKKS P P P P PQ KKKK 
NPPPKKKKKKKK 


4065 


17966 


A 


4089 


1 


205 


FRLRQENCLNPGGGGCSELRSHYCTPAW 
VTEQETLSQKKNFFN"* KTFRKAKSTGNF 
HMPNTSIQQLPI 


4066 


17967 


A 


4090 


154 


410 


ITGCSII KYFYRAMREREKS EVGYFPSS 
SINTPLRHRLSDGVLDSRITSLSTHSPA 
P FI KS KNRPGTVAHACNPSTLGG * GGRL 
TK 


4067 


17968 


A 


4091 


428 


67 


LPHPPPPPPPLSSSSSSSSSSSSSSFPP 
PPPLLPGQGRTLTRPLPAPAGQLGSRES 
CGAP P S S LGAS LHVA* NADGRQ * RAAL S 
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TPHGPLTGSPAAWRGRREQDLLLLLCS 


4068 


17969 


A 


4092 


431 


3 


APTSNKGSTIPILQIRKLRATPCSSIGP 
S PSTLPVKLEL* SVLLPGQWLNFSFCHF 
FPRPKCSNRPRYTLPVPKLPSSGLPPRP 
SPSQQVTAFPPLTSRKEPSSPSTPASDA 
FPFPPAPVSARRQEVLCES IPSVTMHE 


4069 


17970 


A 


4093 


151 


1 


S IMPMWMDAVAHACNPCTLGG* GGWI I K 
SGVRDPPDQHGETSSPPKTQKL 


4070 


17971 


A 


4094 


35 


363 


LQSKIYCTFFFFLERGFFFSPRVGRPGA 
HPGLGAKTTGVKGNP P P * P P KGGDPGGV 
P PP PGC FFWFFLGKGGFFLL PGGG* NFG 
KKKKPLVWPPKGGELKGGTPGPPPY 


4071 


17972 


A 


4095 


376 


1 


LGKSPPFLGAPFFLKKGSLLGPLiLGRGF 
TPP * LGGPHPLEKGPLLGQWNPNQKVWK 
KRGFPQKGGENPPFFFFFFFFRDGASLC 
CPGWPQTPGLKQS SNLStPS SWDYRCLS 
KSLFKENTETLAS 


4072 


17973 


A 


4096 


65 


446 


PFFLFFLNKGPLFTPKRQGKPRNFPT*T 
PCPRGKKNPPPPPP*GGEKRGGPPPPGK 
FFFFFGERGFPGGGVLFLDLGTPPLGPL 
KGGEKRGRPPGPGPKKPFL*KRKKMSPP 
TGGERGRGGKKHCLP 


4073 


17974 


A 


4097 


34 


387 


I KE AD E Y VLTRLiAC VF VGDME P L S S PHR 
LHTRIGSSPSTTCCTPACPCSSW*HGAS 
VI PS QTA YEDWF I TLYNVLYTS L PVLLM 
GLLDQVGASHKQGHFWTDENALEKSQAN 
VSMHSS GVYFFL I KPYYNSWRHQGSWL 
CFLFCNKERGF 


4074 


17975 


A 


4098 


80 


422 


I IjHVL Y T C 1 1 L C F VLQ S * I L WF VLQN Y I 
LHVLYTCIILCLHVLYTCIIYMCYrHV* 
FCILALFAGLiS * DS P YHESL* * S FFMEA 
DISLCRCVITDQLNFICSLLQH*ELRMF 
QF 


4075 


17976 


A 


4099 


74 


402 


I YLLS THLH YQ FSWISI TFD I S FFFFFF 
LKKESLFFPQRGGGGGNLGLLKPPPPGE 
SPFPPPIFRGGGKKGPPPPPWVIFFFF* 
KKGGSPIYPGGG*IFGPGDPPPPPP 


4076 


17977 


A 


4100 


3 


335 


DAWASAWVTE YDS VS KKKKKKRGGGGKK 
KQFPKKIGPRGSLFKPRGGEKPLSLKTP 
PPPFF*F*KPPFKKGGAEPWEPFLiKPPF 
CliKDRGGAPKIYFFPNPPGAPRGAAFIK 
RGGGK* *GPPGGLLL 


4077 


17978 


A 


4101 


406 


1 


RFPKPFPPKPKMSKWALFK*KGFFPPRE 
LLIGPPPPPPKKKKFFPTLFFAKVLPPR 
FF*NFNQVSKKRPTFPLKVGQ*PPPMLV 
Q KKP TRG F PALFKKKKRS S LV I RE I KRK 
P TMRYHLTP I RMAVFHKS KNNRC 


4078 


17979 


A 


4102 


2 


183 


VNPGGGACSEPR* CLCTPAWVTERDSVS 
KKRADNDKQGALRSFFLGFPFFLPLHPT 
SFAF 


4079 


17980 


A 


4103 


119 


391 


DNLQCFFFFFWKRGLLFLPRGGGFKSTG 
PFSFGVPGTPPPPPPGEGGLTPKPPPPG 
* ILFFLEKGDFPLLARW*NRQKKKNPP 
PSPPKGG 


4080. 


17981 


A 


4104 


413 


64 


GFKPPPPKDPPPPPPQKGGFMGWRPPPR 
P F APL FS P P P PRGGGG I F P * GGGGGAKK 
FHIWGPFG*KKPPPGGG*NSGGQKNPPG 
GGGKNPPQKKKKKKQPTTANIWFLCLE I 
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LSGM 


4081 


17982 


A 


4105 


2 


196 


RGRVGKVLI SLFNTDLLWLLC I LICFHK 
TQ * LCNG* CHSRMKC VCVCVCARVC VCV 
CVCERERD 


4082 


17983 


A 


4106 


1 


335 


FLVET* FHPVGIiAGLELLTLGDPPR* ST 
LGDPKCWDYRCEPLDPASVSFFL.CLKLF 
IFVLLKLFFFFKPPLYSKLTRGRDNNYN 
YFPPPNPGPLPHWRADKSIIGGNRRLL 


4083 


17984 


A 


4107 


1 


148 


GERG CSELRSCHC TATWAAE * D S I YKKK 
KKKEPPKFFLAGFSGGGKKTP 


4084 


17985 


A 


4108 


331 


41 


GLNFLAQKFSGRFSPPPPGRVPLGPKPF 
FFLGKPPILEGFPGP*KVFFSPILPPKE 
VLTFFSPPGEKIGPPFSGGACSGFPPKG 
AKKKKKKRPWSN 


4085 


17986 


A 


4109 


137 


3 


TKKERN1 NRG PGAVAHTCNP S TLGG * S G 
RITRSGDRDHPGQHGE 


4086 


17987 


A 


4110 


3 


269 


GFHRVSQDGLYLLTS *SACLRLPKCWDY 
RHEPPHPGFLGFFCFFFFFRDSVQPGQQ 
S ENP SH KKKKD E G P F WKGAGK I FL FKPN 
SSAL 


4087 


17988 


A 


4111 


196 


330 


■ GAIARGRSNCDC J *GSVGHVL*NDPTLN 
CILDQDTLYGS IMINEYYERASDAWALL 
PDEEQLPEGDQTVI A* * A*DMSGGQQHR 
VSLARAVYSGADVYLLDDP 


4088 


17989 


A 


4112 


426 


241 


LLKRVRHEILFTIRGKGFSEPRSPPCTP 
AWATERISVSHKEKKRKVLVRP*VDLSN 
SShSVh 


4089 


17990 


A 


4113 


394 


1 


GPG VF S P S GC L YHARAQ Y F W PHKKKKY F 
PHPGQKKTSVFLKGRPIjFGFGFLIjIFFFF 
LVEMGSCCIAQAGLELPTSNNPLTSASQ 
RAE I KDVSHRSQ * FFVFL FETGS CSVTQ 
A*VQWCNHSSPRTPGLKQF 


4090 


17991 


A 


4114 


328 


2 


KKRAHPIWPDTGIEVPNSPSPNIRFLGA 
PESFFFKRNLPQGDFLEKPNPCYFFFQG 
VTTPPFPLFPPPPKIFFFFF*DGVSLCR 
PGWS AVARS QLTATTHLPGS S DS P 


4091 


17992 


A 


4115 


220 


3 


FKKGEPGDPPPVWFGLGWVWVWVGFFFF 
ERH S F TL VT QVGVQ WCNF S S LQ P P PRGL 
KSFSHLSLPSS*NYRH 


4092 


17993 


A 


4116 


342 


2 


PINFLYLSFNPNQV*RSPNFSKILHFCF 
KILSFFYTLAFLPFSSQLIRQSFFRVYL 
INTICFLQDLYYLYILNLYLILFLFFHL 
ASRLLDFCLFFVFETVLLCHPGWSTVAQ 
S 


4093 


17994 


A 


4117 


124 


352 


TSGFT*TRMKSKIPT*RSFI IDLLEAKM 
KEKFLKAARETQLITYRGTSIQMPVDFS 
WRKTEAKAQWNLMFE VLKE KNCETRSLH 
PATMSFRNEGKTKMFSDERKLRDSVTLP 
LKD 


4094 


17995 


A 


4118 


11 


387 


KTGKLLATS APGDG I MVMVETF CL WS F 
FLSVSKKKKKNPLPQIKNPLPKPCWFK 
RAPTPNH * KAFPGP P KTPP * KQPKKQ I P 
CRPPKKGPFCI*PNPGINPSPLFFEKKN 
KGKAPPFKNSKQK 


4095 


17996 


A 


4119 


2 


407 


NTFQDQSGSSSNREPLLRCRDARRDLEIi 
AIGGVLRAEQQIKDNLREVKAQ IHS CI S 
RHLECLRSREVWLYEQVDLIYQLKEETL 
QQQAQQLYSLLGQFN*LTHQLECTQNKD 
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LANQVSVCLERLGSLTLKPEDST 


4096 


17997 


A 


4120 


157 


2 


LTGSHSVT*AGVQWCKHGSLLPRPLGSK 
RSSSLNFLYSWDHRCGPPHSA2STF 


4097 


17998 


A 


4121 


3 


168 


IALKSGRKVDIRELGHWCPGWS*TPEL 
NQSTCLGLPRCWDYRREPPWLALNMF 


4098 


17999 


A 


4122 


2 


376 


ETGFHCVSQDGLCLLTS * STRLGLPKCW 
D YKMRE P PRP ADGF IN I RDFNPLLRTS T 
CCCHVKKD I FAS P FTMIVS FLRPPQP CR 
TD P I QNTL P L V IMS P VAL LG CD S FS AF P 
VLMNLPFSSDCTK 


4099 


18000 


A 


4123 


11 


221 


GTQLLLRLRGEDLLNLGRGGCTEPRLHH 
CAPFWETEPDPV* EKKQAAKAKNSLLTA 
ALIiKTQVKKDSVAW 


4100 


18001 


A 


4124 


2 


125 


AI IKKTKSNRWW*GCR*RGMLVHCFWKC 
KL VQ P Ii * KAAWRL 


4101 


18002 


A 


4125 


425 


152 


TPVI LALWE AKAGRSLE PRS * RSAWATW 
QNPISTKKYKK*GGRITSAWGG*GCNGLi 
*WCRSTPASATETPSQRKKKK*EKNLFT 
LILKVIK 


4102 


18003 


A 


4126 


3 


99 


CQAGLELLTL*SDSLRLPKCWDYRCEPP 
RPA 


4103 


18004 


A 


4127 


396 


58 


ENCNPLSWSGGI ISGLLLVSGYSNKFPS 
FLVCLLPVIGPPENTAGPSLVPGTKWHD 
HSTL*TSQTPGPSLVPGTKWHDHSAL*T 
SQTPGPKLSSRLSIiPSSWDYRHKTPYPA 
H 


4104 


18005 


A 


4128 


51 


549 


LGQSYLLLLRKCFSFNFQVGDLDISYIN 
IEGITATTSPESRGGTLWPQSSKHTLPT 
ETSPSVYPLSENVEGTAPP*AHQSFMSP 
PSWGGSPLNLFFGGGGFEKEQSPLKKKS 
FTLYPLGPPSEGEGHF*PSPLCFPPGFQ 
SPPKTGIPSGDELGFF*TTPGPDKKK 


4105 


18006 


A 


4129 


97 


362 


RK*SACLSLPKRWDYRHEPPRPACSC 


4106 


18007 


A 


4130 


376 


279 


DQDDLDL1L1TS * S THLG I S KC WD YRHE P P 
HRAS 


4107 


18008 


A 


4131 


133 


344 


1 1 FF F F * NF VLQ AGGQGGNLGS LKP P P P 
GLKQFLGITFGRSWDHGPTSFTRANFCI 
FRKKRVIiLCCPGWS 


4108 


18009 


A 


4132 


164 


3 


MNEMSYESSLLDHLLKRQ*NQCCLGSSE 
PSPPRLKQFSHLSLPSSWNYRKTSK 


4109 


18010 


A 


4133 


355 


2 


GKKQNRKTGNFKTHS AS P P P P P KERS S S 
RATEQSWMENDFDEMREEGFRRSNYPEL 
RED I QTKGKE VANFE KNLEEC I TRI PNT 
EKCLKELMELKTKARELREECRSLRS* C 
DQLEE 


4110 


18011 


A 


4134 


193 


357 


HHCNP S LCQNDLFWHLVL S PPTGVQWYN 
HGSLQP*PPGLE*SSCLSLPGS*EYRH 


4111 


18012 


A 


4135 


357 


104 


LRRL*YENHLNPEGGGCSEPNLLRCTPA 
WATEQDS ISKIIIII* CNFLH* QNMQKI 
KQNPKTNSSKILTEIKISLKGNLRELRE 


4112 


18013 


A 


4136 


179 


381 


DTSLH 1 1 QNRLS I S KKKL * TL YF I Y*DR 
VPLCCPGWSAWQS * FTAALTSIiKRSS C 
LSLLSNWDCRC 


4113 


18014 


A 


4137 


89 


318 


CLE I SCTKIGQDLYTENYKTLLGKVCKC 
PNK*NTTFMDQAE VS ILKI S FLPRLI YR 
LRU PVR I PAG I FVE I LKFG 


4114 


18015 


A 


4138 


315 


60 


KKLPAEIHAPLAEEATD*ABLKPAGETH 
AEVQAPTEQTPAAEAATTIAEASVKVQP 
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P P AE E AS LAKF P AE I QP PAD Q E RGS E D I 


4115 


18016 


A 


4139 


377 


51 


GFTSPFPED*RIPGLFSCIGWTLQSFHF 
RNIGSVPFSPPGFFFFPQEMRVLLCCPG 
WSQTLGLKPSSCLSLSNSWVYRCAPLVP 
GMP I F * LS CFLAIGFLEF VI Y I GY 


4116 


18017 


A 


4140 


413 


175 


NFGIFCRDGVLLCCPGWS*TPRLKPSSC 
LPKCWDYRREPRTVPGLPFLFLNLRKDG 
NSVLE I YLEYKI TQ I I VQRLLVL 


4117 


18018 


A 


4141 


1 


394 


E I FVT KRWSGLG WAHASNP S TLGG * GG 
QITRSGV*EQPDQHGSPERRGLKERRGL 
MVMKATFLGLRHHLEHI VGHRE LRDLHN 
NSKKKISSHPLSTPHVPGTVLKPCRLIL 
CFKHHQKPVRKPPNSPSYR 


4118 


18019 


A 


4142 


22 


376 


LVNKGKTIFFFLKRGWLPPKGKGRGIN 
TVNGS LNFRGQGNPLA* P PKKPGTKGGG 
HNPGEI * *FFGKKEA*QCGPGGSGIPGP 
RRP SGIiTLQKGGT YGREP S PLPGPTKEN 
PRLTKT 


4119 


18020 


A 


4143 


161 


1 


PIPAKYEHFCFFVSLF*DRVSLCHPG*S 
AVV* SQLTTASTFQGSSDPLTSATP 


4120 


18021 


A 


4144 


18 


159 


KHLDPGGGGCRE PRS CHCTPAWVTTAKL 
HLKK*K*VELKSPSVIHT 


4121 


18022 


A 


4145 


383 


109 


LFFYKIQNYTGHGGPCLLTPFLQRVKQK 
NCFNPGGGGCR* PKLVFCPPTWGKKQGF 
VFQKRLKKPKPKPKPKPNWRGLFLPPFS 
KRPKPKI 


4122 


18023 


A 


4146 


374 


164 


QCQLLRSLRWEDCLSV\TGG*GCSEP*SCP 
CIPAWVTARSCLQINKSLPAEIVIPSLY 
JtfQIARG 


4123 


18024 


A 


4147 


25 


375 


RKKKAL F F CQGGS Q P P PSNLMD P P P P GE 
KRISWLNLFPKKEKKKF*KKKKFF*KVK 
KNP FLKP KGRP P PGE KKKGKKGKKP T P Q 
KGKGFLFAPPPKKPIKKSPHFKIPKKQK 
NFWQR 


4124 


18025 


A 


4148 


376 


202 


HYNS KVFPGGPKRS FL FFLKALS FFFAP 
APIPFWLHSKIFFFFF*DRVSLCHPGWK 
LR 


4125 


18026 


A 


4149 


3 


144 


LFCPDWTPTPGLKQSSHLGLPQCWDYRH 
* VTVPGLP I SFVFFFFPP 


4126 


18027 


A 


4150 


3 


188 


QLQSQLLSSLRPEDHMNLGVEGCGKLSL 
HHCTIA*MTE*DSIS*KKILKKKCF*KK 
HSFLY 


4127 


18028 


A 


4151 


83 


358 


GWARWLTPGI PALWEAKAI KS LEDLNA* 
A* QT YAPS * EES LHP IMNTLIRDPVWT 
SFRVTVDTSTIARHWLFCWDLLQASIGH 
ILILGPG 


4128 


18029 


A 


4152 


162 


1 


TVWYWHKNKLSNQWNKI * FLELDP I TYK 
HLVYDKTGISNHWVKDALVNTQCRAG 


4129 


18030 


A 


4153 


369 


20 


FQLLARQRQENGVNPGRGAFG* PKSRPC 
PPAWGTK*DSVSKKKRKKKSCEFVHRIL 
KLiEE AYHS LGSNTAQRL F VS C FHKK I L C 
LSISIPDPYLQFKMWPVPPSKPLYQIDW 
VGLL 


4130 


18031 


A 


4154 


117 


2 


QI FFFFFFFKMESHS IAQAGVQWRHLGS 
L* PLPPRFKR 


4131 


18032 


A 


4155 


369 


217 


FFSPPPFFKPPPPPKKIFPP*RKKNPPP 
PKKKKI FFFFFFFFFFFFFFQTP 


4132 


18033 


A 


4156 


214 


283 


ASPSEGQGLSSGP*GLIN*SDKPSVTHS 
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PKPHCETA*GGTSFLSSSGSSPQAPKSP 
HHGPDSPLSDSIPMVISPPEPPPLRAKG 
CPQGPPGASWWGTT 


4133 


18034 


A 


4157 


319 


712 


QDF VS KQHS YCL KNL KS S S L I Q VAMNRA 
QVCL ISSS KSGERHLYL I KVSRDKI SDS 
NDQESANCDAKGNNFICIMLYKIWLN*K 
RLMK*HWTAI*LKIYFSNKMKFSNLNLS 
SVINSFDVKKERLNANTLN 


4134 


18035 


A 


4158 


183 


369 


LKRKQSQRGEVNFLKVTQPECGKAGILF 
IYYFFEMESHSVGQTGVQWCNLG*LQPL, 
PPESSD 


4135 


18036 


A 


4159 


190 


336 


DPISIKNFWPGTVYFLIETGFLHVG*AG 
LELPTSGDLSTLASQSAGITG 


4136 


18037 


A . 


4160 


10 


390 


QGILLPCFSMSSEERGRIYSNTFSFFFF 
FWKGS P INPQGGS QAKKPR* REPSPSGL 
TPQRPGNGGPPPPPGQNFFFKKKGG*PG 
G PGRA * T PGPRGT P P LGP P KAGNKRGGP 
RGRAKNFLKNQNGPT 


4137 


18038 


A 


4161 


53 


376 


YliFAFFFFSKREAWQGGQPGMEGAKKKF 
MAP * P S KGGGGE FS GLTP PNQWNPRAQ P 
PPRGELILILEKKRPAPNAQPGPENLGP 
REPSPLAPQKGGNKSENPWPPPP 


4138 


18039 


A 


4162 


1 


356 


GVF I L VS FNFWAF FLS FF FLGKQ I LVLM 
PQPESQGQNLAPQNPPPPG*RGFPALTP 
PRTGNKGMAPQAPQFFGFLKKGGFSHGG 
QGG FKT P TLGNL P P IAPQGLGNNGLG P W 
PPPFFF 


4139 


18040 


A 


4163 


345 


26 


APGFKPALGPPGGPFFSKKKSKNLPGLR 
GSTLQAVGPTFLGG * GKRSLKP PGG* AS 
RGPGWGPSPPPGGPN* IPVSEKKEREGK 
KKKGREKKKETFSSNLLIFLSF 


4140 


18041 


A 


4164 


18 


284 


TIiQGTSGIFEGNFFFFLGTGISIiYCPGW 
RAWGGDHGSLiQL* PPGSSSPASTPLGRS 
WDYRHVPPGWASFFFFKTLPNGVTQAGL 
ELWG 


4141 


18042 


A 


4165 


377 


30 


FSGKKKFFQLGWGCGPPPLIPPLWGV* P 
GQPPRVGGFNPPLPPWKNPVFFKKPKQP 
GVGGASPYSPFLGKFRPRIPFTLGPEGS 
G * PNF P P F P P P WAP KKNF S P KKKKKF YN 
CLW 


4142 


18043 


A 


4166 


305 


1 


ESRQVLTSRLTLLS IHEFNYLLNNSTYK 
HITSLPINIRQYGQDLKTLKKETKDLNK 
*GNIPCLGIGRINIVKMSLLSKIiIYKFK 
AMPVKI PGELFLRNQQA 


4143 


.18044 


A 


4167 


394 


78 


EGKKQGRKKGRREGS * EGRREREKTR* K 
AE KE GMKEGGREG KKE GRKE GRRKGR KB 
GRKEEGQKGGRTDGGWWVLRSLDTSQLI 
S FASGP KWREAS S LVWASLCL 


4144 


18045 


A 


4168 


1 


431 


CNTCVR* CAYWCV*MCGERLCWCVCSCN 
RVCCV*VERVCVLVCMSVCWCVCCCARQ 
CVSVCHSRIESSRPSSGPVAPSKCQTHL 
YLPKLSAPPAAGLWDLPSEATASLSGSIi 
CQPS CPRTRS FATLVS SMYHRPVTGELQ 
SIT 


4145 


18046 


A 


4169 


3 


423 


YNARE I EAAAGRDHATALQPG YRVRPCL 
KKKVLKLGQAWWLA* F*NFFFETGSHSV 
PRLECSGVISACCSLNLPGIV 


4146 


18047 


A 


4170 


118 


402 


QICTCTPKPKMKVKKITMLSNNMRYLiYI 
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HLTQD J * DLD I DL * NTADMNLKDLNK* R 
E IHCLRI *RLNI I KTS I FLKLPYRVHAI 
PFKI SARSFVE 


4147 


1804S 


A 


4171 


404 


285 


RGCGE P W *HHCTPAWVTE * DL VSKNKMK 
QNDNLSHLSLA 


4148 


18049 


A 


4172 


274 


431 


ESLVGQRNWVAKYFYRI TAPGRAQRLMP 
AI P AP WE AE VGGS QGLE I ETLL * NNS S R 
PGAE AHACNP S TLGGRGGR I TRSRD * DP 
PGQH 


4149 


18050 


A 


4173 


24 


366 


FLCLFHTCVLAFPVQQLCSVGWGCKVEN 
ADREEKQPCDSQAREEPRLCRRGFNRAP 
NINTRHEKLFFVCLiF*TGSHSVTQAGVQ 
GCGHSSPQPQLPGPKRSSHLSLRSSWDT 
GM 


4150 


18051 


A 


4174 


397 


1 


TEPVLANETWGPWDRGPLGLGGLGPENT 
DNSHAVGADQLLKEPPFPLSTRSIMEPM 
SLNAWLDSPHREMQAGTPLSLCGDTYET 
QVM*SWGIRGILPQQPWKMGASLSLTAS 
LSLSFAHTIiKHKHTHTHTLY ' 


4151 


18052 


A 


4175 


90 


387 


KGRWL F F P QGGE RGHNG * TAP P QTGGKE 
NPPPPPPKDRGKKANAPPPGGIWYLKKN 
GFFPIGQGGPEPPPPRGPPPPAPPKGGE 
KRGGPPHRPGKKLLK 


4152 


18053 


A 


4176 


398 


280 


RRLKWEDPLNPRV* GCS KP * SHRCTPAW 
VTE * DP VSNFF 


4153 


18054 


A 


4177 


253 


395 


LFNFFFLVKMGSRYVAQAGLKPIA*GNP 
PASAS PRAGITGGSHHTQP 


4154 


18055 


A 


4178 


366 


48 


P WAS PG I S LS FS FLRTKP TVKVR* YRVG 
PQRQPCPSRWAPPACPFSMFPPAWVSPP 
GS AAS RP PRHAGSQ VGATAS P P PAQGLG 
APAWTAGLGEKQKLAARMRGT 


4155 


18056 


A 


4179 


44 


413 


GDGVTMSAFFFFFFLEKKVWF I P PGGGPK 
P EFG FRAP PPPGVKRIFG FTP LRTGE*R 
PLPFPPGKGWFFKKNGGSPMWPGGV*PS 
D PKGAP P P GP P KGGE * RGE P P W VGWN FA 
LL.KERKPLFKE 


4156 


18057 


A 


4180 


396 


235 


HEVSLCCPDWS*TPGLKYSSCLGLPNCW 
DHR CRNHGPGKT QLE QGHRQ VAGDSR 


4157 


18058 


A 


4181 


85 


408 


YTCSLRITQDHFFFFFFFLGKKIFFFPP 
GWGAGGQILTSPKTPPGVKGIPWLNPPR 
GGGPGGPPPPPTLFLGF*KKRGFPGGGQ 
KPRPKGDPPPLPPKGGGLKGGPTP 


4158 


18059 


A 


4182 


1 


149 


NHLNL*GRGCSEPRLHHCTPAWVTE*DS 
VLKKKS Y I HLGQNP T VRKVLIi 


4159 


18060 


A 


4183 


310 


3 


ATAPSQFIRIW*RNHGHYIEVQSEAASA 
NLETAVNNPENLGKIMDEGG*MKKQIWK 
EMTSRIFIVEKSMPGFKTSKAQALILGA 
NI AVNFRLKPML I YHLEN 


4160 


18061 


A 


4184 


1 


359 


P TRP L VLDRDRP P F F F FLGS GKKKNPGV 
FLYFGGGWTPPKKKKNSLGWGGGGLKKG 
GGGVFFKPLPFLGGGPFLKNSGGVDKKK 
RAPPFFTKTPL*KKPPPPFPPKIFKARG 
FW*KMGEPFFFCPPPPSFLKKGPPPKKE 
GV*KKPPPPPFLTPPPPNQGNFFFFWGG 
STPPQNIKKPLGFFSSQTPKKKKRAADR 
DLELADAW 


4161 


18062 


A 


4185 


1 


362 


HFTKYLFISC*K* *LLCWMQDCCKKGIP 
MNTNVI QE KATS L YGSLKHKEGEGPKAG 
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EFNAS KGWLDNFRKS FGFKNVKI TGEAA 
S ANQKAADKFPDAI KKI I EEKGYLPEQ I 
FNAGESAL 


4162 


18063 


A 


4186 


366 


47 


AALFLVDPPGRGGVKGPPPPPGPFFVFF 
QRRGVP P LS PGGF F FR P PNS P P PGP P KG 
GGSGGGPRGPA*THGVFLKKNPPNCGAG 
KKKG F F L KKKKKKP VFWF S VK 


4163 


J 8064 


A 


4187 


394 


112 


KNTENLKISWAWWHVSMVSAPNEAETGG 
S TEGTGCS KL * S CHCTAWVT VRETLS PR 
KFFKKRANYLTFRI *KVCSLYHRKTKNS 
FKKHTQGRKF 


4164 


18065 


A 


4188 


230 


372 


KPMFLP FNPAILLLGI YLKE YKS F*HKD 
TCMRMFITALFTITKTWNQ 


4165 


18066 


A 


4189 


149 


2 


NRSLRFMRFKEGSHLPP I * VQGKAASAD 
VEAAASYPEGLGKTIEsTKGGST 


4166 


18067 


A 


4190 


2 


159 


MHQPWLLRRPKQENCLNPGGGGCSEPRS 
HHC I PA* AIERDS LS KKKKKGGAF 


4167 


18068 


A 


4191 


180 


415 


LGFLQIiLLFVGRSMFIFALELSHGWLGS 
SLSF* P VCVKQSSHhSLS S S WDHKHMS P 
YPANFCI FCRDKTLPCCLGWAQ 


4168 


18069 


A 


4192 


424 


41 


PPLLRAPPKGLL* PKNSFPPGAPP* IPF 
PPKKIKIFKFTPGGGGAPLFLLPRQVKA 
EGSLPPRVFGPPPFFFPPLPSPLGAKPN 
PFFFFKKKKKKEWLHKSSLFYSAHFVLC 
LKYSLSFFYQSVKTK 


4169 


18070 


A 


4193 


105 


424 


ELKRLVIKLIRGIPEKGKAQCKEIQKLA 
QEVKGE I FME I GSIjKKKQ * KI QETLDTL 
LKMQNALESLSNRIEQVEERNSELEDKV 
F KLTQ S NKDP S Q I KKKI L YNV 


4170 


18071 


A 


4194 


3 


240 


LCLQSQLLGRLRQENPLNIiRI*GCNEP* 
LHHCTPAQVTERDPVSKENKGSFIPMKI 
GKLVSLTLNRMCNCKNSEKENV 


4171 


18072 


A 


4195 


385 


1 


KMIILTKKMEIKHKDEKELQKTEVDLKI 
ENNTTRARLTNNIKKKPPESEKTNTKNH 
T KKS L * D QDT KKE TRPQKKKNP FKS R * V 
RHWRNI LWLES RNRGKKNP S GVQKRRDS 
VPTLITQTQETWWWCF 


4172 


18073 


A 


4196 


3 


272 


Li F YL I MAL KH KS S D VGNSN I YAKE KLE S 
ASFKER I * KS YVEVAKI YGKNEYCI YEI 
VKKEK* IMHSIYRVQYYLQFQTSLEVLE 
RIPHG 


4173 


18074 


A 


4197 


236 


379 


GGRFKGSNFTSPGLQGNSFFMGPPKLNS 
RAGV* QRREGKNPGVPQFNR 


4174 


18075 


A 


4198 


375 


1 


NFKIPAPPPQNKGPLRVGTPPFLIFSPP 
GS PPKS FPKAKWARRDRPF *NPGETLQE 
G* VFFS GRF FFFFFFRDRVLLCHPGWSA 
VAQSRLTTSSDSRVHTILLVGRGCSEPR 
S HHCNP AWVTRVK 


4175 


18076 


A 


4199 


1 


270 


P S RTGPQ I PRR P TRSNCPNQFLGF * GCG 
EPRSRLCTPAWATEQDSISKKERHGLLV 
FLKGF I HYIRLL INLFAARGRKRVLFAL 
GGREMW 


4176 


18077 


A 


4200 


388 


1 


PPGIINLFDPGL*PPPPWVMGPPPS*KL 
ISPQKKKNKVPPPVPPINIFSGPPPPTL 
VFWVLFFVSLKRQLIL* *KSGVSAASLV 
RHDGAPRLTS S AGKGHAE PTFQ PRPTHH 
RNACSFVIilHIQSYKVF 


4177 


18078 


A 


4201 


298 


158 


FKI VINFQFLTP I FAPSEGGRGG * GRGE 
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LRSCHCTPAWVTE * EPI * KRKI KGKKRT 
EQESRRQNLA 


4178 


18079 


A 


4202 


128 


358 


KLMIPPSPGQPPRPGGRWGGHPPARPAA 
PSGR* GAPLPGRPY* EKRI PSARTPPRL 
GGWPSMSLRTGHDDNGGVWE 


4179 


18080 


A 


4203 


406 


1 


FFFFFLRQSFTLVAQAGVQ*RNIjGSQQP 
IiLPGF 


4180 


18081 


A 


4204 


322 


2 


KPRPRKTPGGPFPI *GGGGAPPPRGGAP 
KGGSPPPPPFPPPGGKKGGKKKNNPPGG 
* KPWGGFFFFP P P F * TPPKKRGLFLKKK 
KTQN P KKKKKGG R S R YRT S P R V 


4181 


18082 


A 


4205 


3 


127 


GFHRVSQNGLYLLTL*STRLHLPKC*DY 
RSEPLCLACFLFL 


4182 


18083 


A 


4206 


1 


216 


GFRRVGRDGLDLLTS *SACLGLPKCWDC 
GR*PPRPAHLGGNSNAKEPGLPACPLLS 
GHMHRAVWVRWATATL 


4183 


18084 


A 


4207 


427 


0 


IiFFTTNFFFWARVFFLPPFF*NPPPGFF 
FLPHKKKKNFPPPPGFFFFFFYAPPPFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
PWLKTWSC 


4184 


18085 


A 


4208 


2 


404 


PRVRLGIWCKERLFLCYFFFFLERGFCF 
LAQGGGPGGNFG*RAPPPPGEKKIPGPP 
PRGGGEKGPREPGRKIFGFLKKKGGGPF 
GPGGVG P P KKG P P P P KP P KGWG YGGKP P 
AQKRGG FF FG KKRAP FFNWVNG 


4185 


18086 


A 


4209 


40 


412 


PLFCLSEKPRYFEYAIYLMLSLKFLESC 
IMSVNS S AVSAS VYY* W* KIERE PLTS W 
TLE KLD P E VFHQKFAF T * RAKA I LRKKN 
KFEGLTLSGFKTHYKTTIITTWY*HKD 
TQIDQWNRIESS 


4186 


18087 


A 


4210 


219 


392 


HFFFLNYNFLGKGVS FCPQAEGQGRNLG 
*LHPPPPGLKQFFCLTLLRNWNHRLVPP 
PP , 


4187 


18088 


A 


4211 


409 


2 


LERKTVFFS PPFI FFFAPVFFLS PFFYT 
PPPLYIFCPPKKKKIFPPPPGKIFFFFK 
GPPPIFFFFFFFFFFFFFFWS*FFIMYQ 
IVFNISYNLIKVDSHLYLFKDKKILLCT 
ISSDAW 


4188 


18089 


A 


4212 


24 


399 


ADAFSTTNLHGLGPDLFTPTTQLYINLG 
MAI LL * AGAVN I G YR SKI KNALAH FLP Q 
GTPTPL I P ILVI I ET INLLILATALWH 
LTAI I TAGHLLMHL IGSASLAGSTI SLP 
STLTVFTIIiILLT 


4189 


18090 


A 


4213 


411 


197 


SWLTAPSFKRFSCLSLLiSS*DYRLPVPH 
PDNFC IGSRDGARM VS IS* HHD PPAS AS 
Q S VG F TAP KVLGLQ T 


4190 


18091 


A 


4214 


218 


126 


KEPEPLLFRSSWSVGAIDLKNVMRRVLI 
KSCTPGWAHTCNPSTLGGRGGQITRSG 
DRDHPG * HKS ALGS CAS CF I YL WDLLLH 
IVFE 


4191 


18092 


A 


4215 


1 


332 


MAPSLCLSNKKRGF I GPGFAGAP KHPGR 
GQGHPREKLAFFKKGPPKPGLKS FLVLK 
TPPPKPFFFLSPKGGALKTEEEP*TKKK 
GGAEKKPPPGEKRPLGPFCPKKPNWG 


4192 


18093 


A 


4216 


1 


185 


KLYLSILISLQISLIITFTATELIIFYI 
FFETTLIPTIiAIITR*GWQPANFLYDLV 
SFHEN 


4193 


18094 


A 


4217 


1 


388 


LRFCWETLFPLPYNPFSFHFSFLFFSFS 
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FFFFLGGKFCFCPQPGGGGPNLNLRDPP 
PRG * KE F * AP P P PRGGENQGS P P P PPNG 
GGF* KKGGLNHGGGGGFKPPPQKKPPPL 
ASQRGRNKGGAPPIiLKF 


4194 


18095 


A 


4218 


3 


195 


VIYSTIFAGTLITALSSH*FFT*VGLEI 
NMIoAF I PVLTKKI NPRSTEAAI KYFLTQ 
ATASHPA 


4195 


18096 


A 


4219 


261 


1 


E P RRRE RKE RKE KKKGATPS R I CM KRE R 
EERFS *ERRERGEEGERGEKEERERGER 
ERER S E E E RERERE R P KN* MRLNGRTRR 
TRG 


4196 


18097 


A 


4220 


5 


375 


DEMLHLKFTYILN*TLKDTIIPKWENL 
YN*DFLNSKVQGTPPPPRPSRPPSSPSP 
DPPPGPPPAGAGRRARRPGRRGSPPGGP 
PAPP PAP PPRAGRAAGGRGRPPARPGGR 
GPAARPAGGGG 


4197 


18098 


A 


4221 


258 


267 


GGGALKKKI YCGGGGGNFFF *SPPPSFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
IYFIFILIF 


4198 


18099 


A 


4222 


127 


330 


KKKKKKKKKKKKKKKGGGP F KKKKF LPR 
GGGKN F F F * GGQ KKKLGGG VKKKGGGKK 
PGAKKKKKRGEK 


4199 


18100 


A 


4223 


1 


377 


RRFHLRRENLQEKQTWMGI FFFFGVAGP 
PGPGGKSFSPPSFFF*GKNTFFTPPPGG 
GGQNLVGENSNSNPPPPRDPPPSGPSEK 
I * KKTPPPFLVGPNTKNKKHFGGGGSPK 
KKTLLPPPPLDPF 


4200 


18101 


A 


4224 


223 


431 


IRKTGPLGFGGQISTNPLPKV^PPPGLTFF 
QKGPPPPPFPPFQKPRFLKFYPKREGPP 
PPKVGLTPPPGLL 


4201 


18102 


A 


4225 


173 


2 


RCKLHCLFPTLAI I TR * GNQ P ERLNAGT 
YFLFYTLVGSLPLLIAI IYTHNTLGPTR 
P 


4202 


18103 


A 


4226 


11 


377 


FLTWFICISLFGRLMGI*FFWFFCEL 
FFHAFFSLFLiKTDIIFSIDLLEIIIYAH 
VCHICCNL I FSLYF VFYFNGSLKYI KC* 
NLFAI I LD FADVS FL I FMFRTGKRQNDN 
QFSLKILIGF 


4203 


18104 


A 


4227 


1 


412 


KNSKVKNATDLLKNASESPHSRIDEAEE 
R I S ELEDRLFENTKSEETK* KR I KK* SM 
PTDLENSLKRANLRVIGLKEKVGKEL*V 
GS FFKDI I * QNVPNLEKDNIQAQEAYTT 
LS RFNPKTTS RHLAI KLS KVKDKER 


4204 


18105 


A 


4228 


373 


163 


IGVFLGGAPFFFFFFFFFFPQGPKRGGG 
KGGASGKTG*GGGGLKLGGGKQGWPPFF 
F LiRT W P T F TRAPGL 


4205 


18106 


A 


4229 


392 


3 


PPC*NRAPGFNFGGPFKKIYSSPPPRGK 
FGSLKGPPPFFFSAPPGGPKFSKGGGPL 
FFGAPDF * GGFGGKN YFWGQGF P * PHS P 
PP P FS WGPKRRAP FS KKKKKKRRRGWGE 
GEGQGSNHLLKHGYFQ I 


4206 


18107 


A 


4230 


211 


400 


YAIiIWLYCILWNYALHSVFFWCCFCFLI 
FKFI YVFLNLLFLFAF* LVLjCLLKVQNH 
SGSLS IVSYFPIjLEACRIFPLFPLF*NL 
*MICFNMVVXjYFKQLCSALSVFIiVLFIjF 
FNF* IYLCIFKFIVFVCFLRRNPTLAQA 
GVQWHDLGA 


4207 


18108 


A 


4231 I 69 


386 


KRIFFFLGPRGGGKGKMWVNGNPPPKGK 
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GNFSA*PPQRGGPKGPGPKNKVIFGFKK 
KRGFYGGGQPPNLGKELAGPPKGGEKRG 
KTHPGGKFYAFLGGGLFSKKRN 


4208 


18109 


A 


4232 


394 


287 


FKEKKKPPPPPLFSPPFF*KRGGGDPLF 
SPRGEKK 


4209 


18110 


A 


4233 


442 


112 


LLGRLRWEDGLS PQGRSCS E P * S CHCTP 
AWATQ * DP VSNTNNKNP MP QNAC S S LNP 
SLPWTAAVWRTLGEHCNGREALPARAKN 
ATKAP VLTTHHI CQTVRTI S CALLPH 


4210 


18111 


A 


4234 


415 


58 


slqkghfppfqeldpqnpgrsgwspvlp 
ttglffsldfirft*fmagvlvgykhcw 
ewtaalvlllflrdrvslcfpglihtlg 
lkrssclsllsswsyrhtiphpast*ff 

G 


4211 


18112 


A 


4235 


406 


168 


HGETPSFLKIQKLAGRGGTLL*SQLLGR 
WRQGNHLNWGGRS S VY I YTE KHTVHTQQ 
SVTTPFMSMSQTLAYSPLSLKSS 


4212 


18113 


A 


4236 


247 


397 


TVCFSVYIYTELLPPQVK*FPCLTLPSS 
WDYRSVPPLPAYFCIFSRDGVS 


4213 


18114 


A 


4237 


36 


405 


RLYCFIKGLNVKNKTFRIVFFFFLETNF 
PFGPQGGGEGANSGFPEPLALGVKKGPR 
PP P PGGGE * GAGP PGPGNFGLLKKKG VP 
RGGGGGPKPPTQRFPPGRPPQKKKNFFG 
PALGKKKVFFF 


4214 


18115 


A 


4238 


407 


3 


KKKKKLTR P GGGGP FS PL P KRVRQKKGG 
NP GGGF S KKKKS P P P P P P RGKKKKP F S K 
KKKRFIjSG1S[NTGKNRLGKDPQS* PAGNA 
RKKPGWERALARGRAPGMTDRKPQHSGL 
DGSQQSGTGQEPGNSGPVPKVH 


4215 


18116 


A 


4239 


144 


405 


PTVKLVLLYLVLLKVAVSKNLR* *GGLT 
V*QQRQILWKRILSNTEFLYLRQSKMMH 
LVL * PTVNTLKS LKSGQARWLTPVI PAL 
WEA 


4216 


18117 


A 


4240 


369 


40 


PLLQGTPAS WPRWGRAGDRPRKEG* LHP 
RETRVP PALHHQ PAPAGVSLTS PGP P FW 
LS LQRLRGHS S S DS LPAVCQ YSGS WRE E 
KAAAE AP ALT PA I CTCTI CSVLTWL 


4217 . 


18118 


A 


4241 


397 


1 


ISKSLFFPIPFWKKKKKGFSPPLNFLGG 
VFFFFFLKI FE I FFLKRFFFP I FLGAPG 
FKIFKKYFPFPRGFKTPPKKKIFLNPFF 
KKKGGGGGAL F F F F FFE TG S R S VT QAGV 
KWHDHGSLQP*PPRLK*SSDP 


4218 


18119 


A 


4242 


230 


393 


ADRPGS PLPS APETGSHS I AQTGVQ *HN 
YGS P * PLTPGLKLS S CLS LPNCWD YR 


4219 


18120 


A 


4243 


434 


104 


KQPTFQSTTGGSPSAGRPPPPRAKKKKP 
PFF*KKKKKSGNIGGPLFPPPPGGEEKK 
NL F PRKGEGS HKTD S S P PH * RGGKKKNL 
S P KKKKKKKAW C WD YRRKP LC P AT Y F 


4220 


18121 


A 


4244 


2 


425 


FVFKI ILKNLSE IQENLQFS KI KKTIHD 
LNKKFNKETGN I TKNQTE ILELKNSMSE 
I KNTVKSFNNRLNQAEERI SELEDRS FE 
MTQADKKRKKGEE S LQDVLYTMK* TN I L 
WSFQKEKRKEKGLQNILNE I VAEMFPSL 


4221 


18122 


A 


4245 


12 


424 


I YYKTLNCT * I F * FGGALAC FHFSYPFY 
IHYVGFVPSPVYLFSHVI ISVWTSGYLL 
YSLGCNP I LSLFI FLLKLFHFGYLT I QV 
GI YVWILFVCLFLSNSFINI * FTYHAI P 
IPIYIPLKVYNSMFYSIFTTK 
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4222 


18123 


A 


4246 


215 


425 


LSIQISQIRNLIYLFINIMGKILRAKRK 
*GTSDAVVNQTIFLRPRTVAHSCNPSTL 
GGRGGRITKSGDRE 


4223 


18124 


A 


4247 


413 


2 


WEVESFSLGKAFMWGCATENPGPPHPKF 
LLFIMKAPPPWRVGFQDCFNPKDGRFr* 
P RT P P C PRARGAKKNS VS KKKKI K I W KR 
WPPTPLPPPGNGGTRKFPSSPPPPPPPT 
TPKI TPPGGERQL * PQRAEGAGDTK 


4224 


18125 


A 


4248 


61 


281 


ITWKDFDSFSVLRVSKAFHLALPAFIGN 
C I KCH I KMG I YPEEKFKNIHKYL YLE I D 
VCGH*TWKKIFFLFFFLRQRFTLFAQAG 
GQWLDLSLLQPPPPGFKRFSGLiSFLSK* 
DFDSFSVLRVSKAFHLALPAFIGNCIKC 
HI KMGI YPEEKFKNIHKYLYIjE IDVCGH 
IRLGKKFFFCFFF 


4225 


18126 


A 


4249 


23 


13 


RVR P FAS PGGRE VT VCGGLGRE GQE RRL 
RWHRKPFLPLAATSCAWPRSVTPSVTS 
GSADGDGQSLAVGAGTLATVGGLELLNS 
ND P P AS AS QGAG I AGVS HRAW P * T 


4226 


18127 


A 


4250 


182 


342 


KGFFFLPPGGGGGGEF*FNEPPPPRVKG 
IFPPPPPGKGEKPPPPPFPGYIFVF 


4227 


18128 


A 


4251 


2 


219 


PSLRKMQKLARHGGTLLWSQLFGRLRRE 
DCLSSGGRGSQTHGSEL* SYHCIPAWAT 
DGDPVSKNTKFFFFFF 


4228 


18129 


A 


4252 


223 


324 


LGAVAH X CNP S TLGGRGRR I TR * G VRDQ 
PGQHGE 


4229 


18130 


A 


4253 


165 


336 


ES DAQGHQVAARVS HVLE KDALLVFRS L 
CKLAMKPLGEGPPDPK*ADSSSWPPSVQ 


4230 


18131 


A 


4254 


3 


233 


ETAFCHVGQAGLE LLTS SDP PASGLQHA 
GI TGPATKSAP * WS * VSGPHLGAFDSLA 
[V3LIiAQDWRALLESAKFFPLIi 


4231 


18132 


A 


4255 


394 


44 


AKIFPP GVKKF FAS TP P GGGKKRGP P P P • 
PVNFFPFKKGGGFPPWPGGV*NPAPINP 
RPWPPKRWGFKGEAPPPPPKLNLFFPQG 
GGGKNFFGKKKGAPTGGPPFFFFEMEFH 
CRPD 


4232 


18133 


A 


4256 


152 


2 


YWS PFYCRSCC IRQGTVAHTCNPSTLVG 
QGARFTRSGVRN*HGQRGESPR 


4233 


18134 


A 


4257 


404 


190 


AQL I RGWRQ * DHL S LGGGGCNE PS S CHC 
TPAWITEPNQSLSQKTQKQKMLIPN*YG 
*TVSPPKSHPELQFS 


4234 


18135 


A 


4258 


288 


407 


ENCRPRAVAHACNP S * ENCRPRAVAHAC 
NP S AFGGRGEQ I PRSGEQEQP VQHG 


4235 


18136 


A 


4259 


406 


163 


GWGVQTHPGHHGETLFFLKKKK*VGGGG 
PTRYSPIFGGVGPQNGLNPQGGGSKNPK 
LNPLLPPWGQKGVFFSKKKKKKERD 


4236 


18137 


A 


4260 


101 


276 


LTS I VP S LWE AEGAKLRGLGRPACRE S S 
LCHCTL P WVTG * NL VS KKKKKRGGALG F 
FF 


4237 


18138 


A 


4261 


2 


414 


WVAAT PNLVF VGRGNCQD W F TG P FLTRG 
GWGGVL F P P VKRV I KGLVAP FSGPIPIR 
KNHQKVGGFLEPNKTNFKE*NFVENKNT 
RRGI KKNVPEKKKPQKGI PLPPQI FNEN 
QTWGDFDFFFFAKKKKI INSFLGWA 


4238 


18139 


A 


4262 


337 


382 


RDARFLHCS LV* L I T * LQ KE FF F VGQAG 
VQWHYFGFLQPLPSGFNQFFCLSLLSRW 
HYRGPPPSLGKFLVGWLVGWLVFETKFW 
FCWPGWRAITGFWVPATYTFRVQAIFL 
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G=Glycine, H=Histidine, I=Isoleucine, 
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4239 


18140 


A 


4263 


406 


184 


QLLGRLRQENCHLGGEGCS *LRS PPCI S 
AWATKLDPVSEKPKTKNQKPTGLGGLNQ 
GPASLLKKIRKLGKKRPG 


4240 


18141 


A 


4264 


233 


440 


L P VHHGWGS LRKLT I LGKGE ANP S F FT W 
GQQGKAKQ KGQ KQ P GAGAHACNS S PLGG 
QGR* ITRSGVQDQ 


4241 


18142 


A 


4265 


2 


230 


GARL* S HLLi VRLtRKENHLNL KG KNCRE L 
KWPHCT PAWGT KKD S AAKNKNQ P L P P P K 
GVKSPPQREHGLIVTCMFVQ 


4242 


18143 


A 


4266 


1 


349 


HKTKIHFKLVMVSNVFTFFFFGKGFSFC 
APGGRAGPLFGLVETPPGREKAFFGPNP 
PRRGE * RAMAP CPGKI LVF * KKTGFPRG 
LNFWPGGPPPFGLQRGGMNGGNPWPGPE 
FFWF 


4243 


18144 


A 


4267 


2 


375 


R S E Q PAGS AS S GNEGLS TRAS G CGGC1G 
S PNS AS PPALCS I S RRALAAF P RGRARD 
LQPAMPEPPTHSVGSCAAKPPR*APPPA 
PRRPVPSTTQGLRSAGAQRGTGRHLHLQ 
PQCRIHWVKPAG 


4244 


18145 


A 


4268 


377 


3 


TPAWMTERDCIWRRRTSAPGGSWPSGPV 
PSPGAQ*RPPSQGLGLWWAAAAAPRC*T 
APGPRPPPHGPGSPQGASPPTRPPRCRP 
HPRAGSAGPTGATPPGSTQGQRRRHSHQ 
LPGHPGHRVALG 


4245 


18146 


A 


4269 


1 


294 


LEDWGGRGARAHYDGFSLEPESjDHYRLR 
LGQ YHGDAGDS L S WHNDKP F S T VDRDRD 
S YSGNWAL YQRGGCWTHACAQ * I LDAVY 
ATGP KYQVHCEALH 


4246 


18147 


A 


4270 


2 


218 


TGR 1 1 1 LS QGLQTLLPLI AF * * LLASLA 
NliALPPTIMLLGELSVLVTTFS * SMITL 
LQKKKKKKKKKKKI FF 


4247 


18148 


A 


4271 


32 


443 


LH S D VDQLAGL VF PG C P WPLAS PARRAP 
AGPWPRRAAAPP S * DAPAPRLAVS AGS P 
AWPPPST* GDPAPAAWAS PAS AS PTS A 
RSRS*ASPTARRCRPGPGTAAPTSASRQ 
CQWRP*PCQRSPPSGTCSSAAAAPT 


4248 


18149 


A 


4272 


434 


57 


HLS FPPAS AAPKPL PQAASLWS SPPSIV 
LAQVPPMNTDPCPQPSA*PASELSPEMP 
PARPQAPPENWSHP * G CGAQ T VAP L VPG 
ATPAHRPASPSPVTLPVLTAGGHCFYLY 
TYQIFLKFHYIKR 


4249 


18150 


A 


4273 


288 


424 


GLSLSVAQAWQWCDLVSL*PPPPRVKQ 
FSCLSPPRIWDY*HPPP / 


4250 


18151 


A 


4274 


1 


429 


NTRGAAGPPQMPHPPRASAFPENPCGRK 
N * GQVSGP * ASGS S P VKWAG PAGTWRKG 
GLGWGPTRVRGRGPRRRPGAS SGYARQQ 
GPGHPGFPPSPRRRLSVPACALCPGQTS 
GTRAGLAVWPQIGLCLKAQGARSCPRDH 
SSD 


4251 


18152 


A 


4275 


253 


3 


PHQFNAVIYFPSPVSRGCPNNALYSPFV 
I WL FCF VKTG VS VTQAGV* WHDHS S LQ 
P*TAGLKRFSSLTILSSWDYKSTPPCI 


4252 


18153 


A 


4276 


406 


3 


PMVSQGGVCAK* QARAI PREG I FRRSAD 
TQWREVQQKYNMGLPVDFDQYNELHLP 
AVILKTFLRELREPLLTFDLYPHWGFL 
NIDESQRVPATLQVLQTLiPEESYQVLRF 
QTAFLVQI SAHSDQNKMTNTC I 


4253 


18154 


A 


4277 


373 


3 


DGVSLCP PGWS AVAQSWLTAVL * ALGF I 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine 3 V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /—possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FtiFPVGGLTGIVLANSSLDIVLHDTYYV 
VAPFPYVLSIGAVFAIIGGFIH*FPLFS 
GYTLGQTYAKIHFP I IF IGVNLSFFPQH 
NLGLSGMRRCI 


4254 


18155 


A 


4278 


71 


176 


AGGRGCNESRSHHCTPAWATE *DSVKEK 
KKGLFFW 


4255 


18156 


A 


4279 


2 


379 


SPFVPLQSSLGNKSETPSHGRKKKRRKR 
KRTDEWQP*KHPLRARPGPGAPRAQGGA 
QGSGGGRRAGPCRKLVHRGHTGNPKREP 
GL I P QGEGRS LG I H P SNG CH KP CPS RGR 
PPPRPKQGRGRMQN 


4256 


18157 


A 


4280 


2 


338 


CDMPRRKLDPLSGRNTLGFVCWWAQTS 
GbK * S S CL CL P KC WD YRHE P LR P VWV YF 
KLWHPSPWKPGFYFSVFFPFYLPLCTTA 
SDFCSCLPPCCSRVTLVGSESSCLWPAG 


4257 


18158 


A 


4281 


407 


1 


E ERGVATGHTAE RGRADNE E E RGE VAN K 
RGGRE VQ P E ARRMTT TATGTTAATRGAR 
T * TAATTATAP VTRTAAP AT TAS STRTL 
RTPRRARTASVLWARCRSRWSATWRASR 
TCDCRTASHSRPTATRSRTARMA 


4258 


18159 


A 


4282 


367 


225 


P CDS ACIiGLPKCWD YRRE P PRPAGVLFL 
* AS F S * QS KE V YVC KLjTHA 


4259 


18160 


A 


4283 


452 


1 


NTCGLQSVCRGTQRGGPKRCPCHPRERW 
DLGDEQESSKEGRPGREGVQGQGLLGAT 
PGAQGSH*GL*AEAAMSSGHALGPGQVP 
LTPL S P S P PNPHRRPRAGRASRQSRES T 
EAQRAVPSQGAAPGWETDWGSSHQWQPC 
QAQQEGRTGR 


4260 


18161 


A 


4284 


436 


22 


CHVTGTQP I KVS WAKDSRE I RSGGKYQ I 
SYLENSAHLTVLKVDKGDSGQYTCYAVCT 
EVGKDSCTAQLNIKERLIPPSFTKRLSE 
TVEETQGNSFKLEGRVPGSQP ITVAWYK 
NNIEIQPTYN*EITFKNNCIAAARRI 


4261 


18162 


A 


4285 


301 


403 


LS IS * PCDIiPAIiVSQSAGI TGMSHHARP 
RVKSLI 


4262 


18163 


A 


4286 


380 


2 


AQVLYSSREQERRQDLPEQVIQAEGE*V 
KASACQLTFEDEEAMESGPAALDKDFQC 
TRKHHFAEVQGSPRCMSSRYLVDGCPKT 
FAEAQNVCS * CCEANLVS IHAFTFILRI 
QWCTSTVNQAQVCI 


4263 


18164 


A 


4287 


453 


3 


YIYEGS IMEEPRKPVPKGPLGLHCPGKF 
QG*RNSYNHHAVRVGTRCAPEGVKDLTS 
SLQSVITKPEQNIQELMKHFKKEKSEAE 
NHIRTLKAESLEEK3STMAKIHRGQLEKLK 
S Q CDRLTE E LTQNENENKKL KL K YQCL K 
DQLEEREDV 


4264 


18165 


A 


4288 


52 


400 


LDLYFFYRQCLALS PRLQRSSAILAHCN 
LKLLGSGDPPTPASQSKEITSMRYHI*P 
NL*NSMC*NCVFDTTHRXGENIWESYI* 
*EINIQHACRIPKTQQQKTHFQNKQGLH 
SNLN 


4265 


18166 


A 


4289 


2 


263 


IHGAHL WS QliPGRLKWE DRR S PGS RGCS 
EL * SHHCTP VWVTEGDPVSKKRINI CKQ 
NLNEKTNLTVDTPRHS FTHKI KS LNHFQ 
MFL 


4266 


18167 


A 


4290 


3 


142 


YLGVHRLSIDGLDLLT**SARLGLPKCW 
DYWRDPPRPANFYFILFK 


4267 


18168 


A 


4291 


324 


19 


EMSKRHEQKFHRRELQMANKHVKRTSLA 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, B— Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GJycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X—XJn known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














IREMKIETTMKYYHTPIQMA*VRNNDTS 
QVQWHT LQVAS P LVAE VGGWNE ARNLRL 
QIVKMVPMNSHCTPAWVT 


4268 


18169 


A 


4292 


2 


204 


IHFCRDRDSLCCPGWPRTAELKQSALLG 
LPKCWDYRH* ATKPGLQSVFLLGLKI FL 
I LLVRGVDRYL 


4269 


18170 


A 


4293 


427 


3 


RRKTVTVPRTGERGI ISNEDNGLCKAVG 
NIPGSPLCNVSKLNIMKHAPGGHAARFC 
IWTESAFRKLDELYGTWRRAASLKSSYN 
LPMHKMINPDLSRILKSPEIQRALRAPR 
KKIHNRVLKKNPVLRPL* RI KLTSRHKR 
R 


4270 


18171 


A 


4294 


24 


372 


F I CPLQDYVI CS DVTS CWKKI LATTGR 
RLRDDSVQFSFFFFGKGGSLLAPRVKEQ 
GGDLG*WNPRPPRLREFPGLALPRCWNN 
GLAPPPPLILVFLEKRGFPLAGKMGLNL 
LHSR 


4271 


18172 


A 


4295 


1 


332 


IHSSYFFPQS*FFGTINTSDKPLRILIK 
EKKREGTN*HIMHEPWKFGIDPEDIIKV 
KKKYYKQLCTHKFDNLEEMVHFLKKKTI 
HL I * NR * FE * LYKY YRN 


4272 


18173 


A 


4296 


407 


30 


WSVIYGEN*ESLGWGAHPSHVANTRVTG 
LRYLFS YGRHALDMLES SQDNMRS W W VS 
QMS S E I D VDNLGH I S L CNAVQ * I RNLN P 
GLKTSKIELKFKELHKSKDKAGSEVTKK 
EFIEVYHELYAVG 


4273 


18174 


A 


4297 


350 


1 


YKTWMNKE K W * YVG YN I E Q E Q KLAL KT 
TVLDEWYTLPDGRI IKVGGERFEAPEAL 
FQPRL INVEGVG VDELL YSTIQAADVGT 
RSEFYKHIVLSGGSTMYPGLPSRMEREL 
KQHV 


4274 


18175 


A 


4298 


623 


1 


SRRGCAATCDGS ITAWPQRQAAQGKPS V 
HSKLEAEAKPTPGDHAASESTGFSCLGP 
GGVHKTAHAHARNMPGDSNTGSGQSPAG 
RRWEARGSAPRHHPTQPDWTHPDALRQA 
MARNPAS S F * S F * RCCTAAS P P VPTP P S 
PVLMRGPVPGGGGGQGKIRPLQEEAPPP 
SSPWLSR*PQAGTPSSPAVSSLYHGGL 
SPTGRQDRWGR 


4275 


18176 


A 


4299 


469 


3 


PVRNCLGARFRVSGRAAHHAL* QSAS TL 
QGDPRTKRQAI SA* S SAFDVQDVSHVTL 
PFYPKRAQSKDLIKEAILDNDFMKNLEL 
SQIQE IVDCMYPVEYGKDSRI VKEVDVG 
S CTYVMEDGKVGVTKQGVKLCPVGPGKV 
YEELAILYICRRPCI 


4276 


18177 


A 


4300 


506 


321 


KPhSLhKIQKhAGHGGSCh * LQLLRRVR 
QENCLNLGGEGCSGLPKFWDYRCEPPCP 
DCWPS 


4277 


18178 


A 


4301 


454 


1 


NGEKPTYSGKKYVFLF*NTRRPTLFTWG 
KTN*GERPRSSVYSCSSWRLSSSSLGRS 
*NSVGSLNI ILATLTAQELSNS * ANNLI 
* LAYT I AF I VKI PL YGLHL * LHKAHVE V 
P IAGS I VLAPVLLKLGGYGI IRLTLILN 
PLTKHIAYPLY 


4278 


18179 


A 


4302 


3 


463 


AVSLPLPLPRFGRLPPLRVGWCPLRCH 
ILRTQRLEGAPSR* Q TGAKQ AGVWE ELR 
SRLSLGPELGRIRPYSGDPGQVTDSSGA 
LAS S LERG C YP AALL VGEGE GEMYLGP Y 
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26 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K-Lysine, D=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q-GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrostne, 
X-Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 














HPT*DGLKRLSLAKPPGGETRLTRSGHT 
FWGEGIPPSAVCL 


4279 


18180 


A 


4303 


469 


2 


FFFFFSETESCSVTQAGV*WHDIjGSIjY 


4280 


18181 


A 


4304 


194 


3 


FFFFFSETESCSVTQAGV*WHDLGSLY 


4281 


18182 


A 


4306 


517 


254 


WEETAFRLIGKTGIGLVTSGDIAASAS 
QRGGMTGVSHCNRKEFVLMNI I VMSSGL, 
TFLPFSPDEV*GS*GLSFLPSPLALNTG 
LRN 


4282 


18183 


A 


4307 


276 


3 


KIKMEGIPLHIPNPLWLNLGLLFILAT 
SSLAVYSIL*SG*ASNSNYALIGALRAV 
AQT I S YE VTLAI I LLSTL IIS GSFNLS T 
LITTLV 


4283 


18184 


A 


4308 


234 


398 


KKFF FFPPLEGGGP ITT I WS P PLPG * RE 
S PAP P P P RGG I KGLiAP P PNL F F FLDKR 


4284 


18185 


A 


4309 


403 


100 


SGGQKAVGP P WAAP PGYKQNTNKLGPLP 
QPSTEGGAFWPTQGPKPTGLPPLPG*LN 
PRNPTPP*WPPPPPPFNTRSKGPHTPSP 
TPLCGPPPKNPRLFFFF 


4285 


18186 


A 


4310 


399 


85 


WKP I P LAPE WRG V I RFRAL P P CRPLHCP 
TLKLTAPSVIHQRT*VHWGFLVGVSLSV 
FLERESCSFTQVECSGTKLALRSLELLG 
VSDPSISAFQRAGIAGVSHHA 


4286 


18187 


A 


4311 


230 


2 


WTEEDTRRCLVLFSFFLSFFLFCFVLRQ 
RPAGC PGWSAVTQS QLTVAS TS IiAQAKR 
SSHLSPPQRS*DYRHTPQCI 


4287 


18188 


A 


4312 


1 


420 


NTS WGVGELSL IVI VINMLLP YVWLPKG 
RFFFPQNSRFPSAPPHSSPPGLRSDFSH 
SGGL F FHLE VLWGIj PL P P P P P PAS THVR 
RPLGTQGCP * LAWYIHLI S AS YQKANAA 
PQLSCILQDCIRSKGDILIFTFFTLCLS 


4288 


18189 


A 


4313 


403 


207 


ESSEG*IiNPLAHIiAMRYKGCPFKDVREK 
SEFILKS IQVRKS ILNTRMSLLQLDFTC 
RYSRCVCFY 


4289 


18190 


A 


4314 


469 


328 


TSE GGGFNELRL CHCTP VWL TK * DS VS I 
SQRIRKKNLKTQSLWRCP 


4290 


18191 


A 


4315 


1 


212 


NTLPGVE VLKRLRWEDSLNP * GQGCRE P 
* SCHCTTAWATEKDP VSKKKKKKKNPGG 
FRPHSLSQPFGPPK 


4291 


18192 


A 


4316 


63 


559 


SNLTFWQCAVPAVETTPSSLCGPVPSPT 
SSATPKPIPS *AACPPPDCALAAEVRAIi 
PAAGRPRFSEACLTPQPNPCVLWPCLWD 
HS PTS CPTGS S * APHSCAFLPVPRLGGG 
MTTVHPTVHLP IHSSLTFAWE * PSPPAC 
LTAPRPTGHCLLSVPPSAYSSSSPS 


4292 


18193 


A 


4317 


135 


405 


PAPPS PHPKPDS VS CVIPPS * PAPPSPH 
PKPDSVSCVIPPSHVAGPSGLPEMTLLE 
PRGPCPPEVPPTSPFVG*AWWPHPPARR 
ASGRMDGRTDGRGRHLDLRSTHS FLTP 


4293 


18194 


A 


4318 


413 


1 


LYLVGRA* SFHRSFPGQERIHPGGKPYD 
CKECGETFISLVS IGRHMLTHRGGVPYK 
CKVCGKAFDYRSLFRIHERSHPGEKPYE 
CKQCGKAFSCSSYIRIHERTHTGDEPYE 
CKQCGKAFS CCK YI R INERTHTGGV 


4294 


18195 


A 


4319 


479 


342 


GVGVDNDGILVLGATNIPWVIiDSAIRRR 
* DNTVNI LFLTADVTSR 


4295 


18196 


A 


4320 


400 


26 


HS EAVTT VRSHH S P VGH I HS LNAP P TVA 
IiTP * YNKP LN I F V * ARAQK I Q GQ FLTHT 
IPHILLLVITRDTSRACVCVCVCVCVCV 
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NO: of 
peptide 
sequence 


M 

eth 
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NO: in 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, A=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CVYAYJWVCMCVYIYVCVYIYICNYTFI 
PTLVHLPINKMY 


4296 


18197 


A 


4321 


370 


1 


S DRGQGKAGQATRLGSRQTGQDEDKGTE 
KSIPS WDKGP VNNE FGKS VNVS SNLVTQ 
E PS PE E TS TKRS I KQNSNP VKKE KS CKC 
NECGKAFSYCSALIRHQRTHTGEKPY*C 
1STECEKAFSRNV 


4297 


1S198 


A 


4322 


342 


82 


GLGLTTN WLLTGKS AGRGCAP S S GGL WLr 
AVGDS VYYHRETEGGKPS PGRP CC * p S * 
AGIAPPSPPPPGCPEAPGSCCLHYRCLL 
TAH 


4298 


18199 


A 


4323 


284 


3 


I TKKFPRVI GGGCF S P FFMGLS KKNGL F 
P EGALS ISLIFGPAI PPGGAQVI SSAKE 
KKKVLSRPGWAHACNPSTLGG * GGR I K 
RSRVPDQPV 


4299 


18200 


A 


4324 


365 - 


3 


QVEVYWENHPFRIiEESIYQ* CRLEGAT 
SVAGEQISEYlSriSMRASDGGSPPLiSTET 
HITLHVIDINDNPPTFPHLSYSAYIPEN 
NPRGAS IFS VTAQDPDSNNNARI TYALT 
EDTLQGVY 


4300 


18201 


A 


4325 


253 


3 


PSFLSRD*SHIHKRIiECSGMISADYNLH 
LPGSSYPPTSPSQVAGTAGPSMSLQEGR 
ASHDITSSSRS IGAKELLRPADHPQC I 


4301 


18202 


A 


4326 


307 


95 


FLFLLKDKI LLS PRLECSGT 1 1 AHCSLK 
LiLDSS YPPAVAS * VAGTSGMCHYTWLRW 
KNHLSPGIQGCSEI 


4302 


18203 


A 


4327 


371 


2 


WAPNHISPTPRSGSXTRIWAPTPPSNSP 
RPCQDPGHRTDPWYPEEEFLLTNPDPPR 
APAS WS FPFQE RRLHFPALP CP * HLDS S 
LGPTTLFSFSFPPTSIRPSQTNHSKGTP 
P PIiS YAQHDCI 


4303 


18204 


A 


4328 


409 


1 


RLLEARQPELEMAAL I FFTLKYKHVERE 
QKYHQLQDE YFTSAWLTL I LAALFGLV 
YLLIFPQSVWLLLLVLCICFLVACVLY 
LHITRVQCFPGCLTIQIRTVLCIFIWIi 
I YS VAQGC WG * L P WAWSS KPNL Y 


4304 


18205 


A 


4329 


262 


2 


LHGAYLVLDI IGAQNRKPRYS FKWRVGQ 
FFFLVFSPRDSLTLLPRLECSGTVMAHC 
HLDLLGSGDPPTSAS * I *GTTGAYHHTQ 
HV 


4305 


18206 


A 


4330 


1 


399 


NTF*FLKGMGLVNHVFTEDNLKKIiYVSN 
LG IGHTR YATTGNV 


4306 


18207 


A 


4331 


408 


3 


SSVGIHVRTHTGEKPYECKHCGKAFSCH 
SSLREHVRTHSGEKPYECNQCGKAFSHA 
QYFQKHVRSHSGVKPYECTECGKAYSCS 
SSLRVHVRTHTGERPYECKQCGKTFRYL 
AS LQAHVRTHAGA* IYKYSGHV 


4307 


18208 


A 


4332 


110 


426 


GLS PGT WSDMTGGPAVTAFP CTLTGTHA 
GHI I IDGID IAKLPLHTLRSRLS I ILQD 
PVLFSGTIR*APPPLRPTQPQAGSVPSD 
LEHKE E G VGGDQD P RGVS CSC 


4308 


18209 


A 


4333 


133 


1 


EPCCPSALAAPEVLGPEKYDKSCDMWSL 
GVFMYIL*VPSPPPLY 


4309 


18210 


A 


4334 


165 


1 


1 1 LTFFKKNERKWKAVQYW* KNR * VDQ 
WNRI AS P E I D PHE * S QL I S D KGAKE C I 


4310 


18211 


A 


4335 


42 


440 


SARRAGDPARGAPSRNNASLPDPRELTG 
RPAGGLQ*GNCPAPPTPQLPSPSVSRPV 
SGRGP P P P PS FGDPRAWRPQNPKGAKW 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Iso leucine, 
K— Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine 7 
X=Un known, *=Stop codon, /=possibie 
nucleotide deletion, Y=possible 
nucleotide insertion 














LTVQMGEPGE IRRAACPQLSWAMTFPGS 














LPEPKRTGPQIQKSPPSRRGG 


4311 


18212 


A 


4336 


1 


160 


NTCGGGAQL* SQLLRRLMWEDLNSGSQG 
CSELRSHHCTAA* ETKGDPLS INEY 


4312 


18213 


A 


4337 


236 


1 


QRSLLCYIMEIRTVAVRIVAIKGVESEV 
YLAMSEEGTVYAKKECNEYGIF*ELV1;E 
SHYYTYAAAVLRPVCCIESKVS 


4313 


18214 


A 


4338 


408 


3 


CQSSVSKKERTNGAQNPGAAKQGNNELR 
DSTEQFQEYYRQRLRYHQHLEQKEQQRH 
I *QQMLLEGGVNQEDGRDQQQNRNEQFL 
NRSIQKLGELNIGMDGLGNEVSALIQQC 
NGS KGNGSNGL* VNS FDTP PHV 


4314 


18215 


A 


4339 


363 


1 


RSQSSLPKSFKRKI S WSATKGVPAGTS 
DT * G VQPGWQ * RWGASTATTQ KKPS I S I 
ATESLKSLIPDIKPLAGQEAWDLHADD 
S R I S E D E TERNGDD KTHD KGL KI CRT W 
RARYSINEV 


4315 


18216 


A 


4340 


1 


353 


DVFLDTLARPLRHS CNFFLCYLL*DFFK 
FIPQFFYWMFYFDYIFISRRNFYFLLLT 
HFFTLPLFLVIRIQCLHLFLFFSFFFFF 
FGKGNP FDPPGWRARAQFWVNGS PPLRV 
NALLP 


4316 


18217 


A 


4341 


376 


3 


LELREGGFLPHIADEVREERS PALLDDR 
AGRCQGQPRIQVCIiTPKSMLFLAFHM* T 
CEHCLEQCFSTSL*PIEIRTLH*D*GCV 
CMCAYVC I CVCVHMCMCAY I C I CVARMA 
KRPLESIHTSCT 


4317 


18218 


A 


4342 


158 


2 


LAFFFFCETQSCCQWHDLSSLQPLPQRF 
R*FSCLSLPSSWDKKHTSPHPTCI 


4318 


18219 


A 


4343 


145 


2 


I FGEQ WFDYMNKFFSGDF * DFGASLTQ 
AVHTVPNV*SVIPCHPPRV 


4319 


18220 


A 


4344 


1 


434 


RSLIFRATAYEYRYIDQADLKLPASKRS 
S CLGLP KGWD YRHE P PCLARF I F WQ I F I 
AYKLR* KHYKSRQRSLPSRGSC* CACDT 
FVSD P SHWGTMAQ I LGGW I TLGQVQRS C 
LADQPLLPLLPPTliAPCLP *NDVLLFFF 
TQSQ 


4320 


18221 


A 


4345 


4 


475 


KHS CRS LEHLDR P L P PALQE T CP VRAE P 
IjLjL VR I NAS GGL I LRMGA INR CLKHPIiA 
RDT P VCL1I1AVLGE QHS GKS FLLNHIxLQG 
LPGLVRAGRGRREAGSKDGGSCLGEAGS 
GIPVCGDKEPTN* CSPFSPLQESGEGGR 
PRGGEASLQGCRWRANG 


4321 


18222 


A 


4346 


313 


442 


DSTLNFFFFF*TESYFVAQAGVQGHNLS 
SLKPPPPGFKRFSCL 


4322 


18223 


A 


4347 


437 


338 


AEVQEFYGDYIAVNPHLFSLNILGCCQG 
RNWDPAQLSRTTQGLTALLLSLKKCPMI 
RYQLS SEAAKRLAE CVKQVI TKE YELFE 
FRRTE VP PLLL I LDRCDDAI TPLLNQWT 
YQAMVHELLGINNNRIDLSRVPGCMRPL 
*RIHPGTLERSIRLLFMPSSSWTMA 


4323 


18224 


A 


4348 


409 


118 


GLHDAS FVAAI PS TQ WYFFFE ETASE F 
DFFERLHTSRVARVCKNDVGGEKLLQKK 
WTTFLKAQLLGTQPGQLPFNVIRHAVLL 
PADS P TAPH I YAVF TS Q W * AAGP S GRWS 
TS S AAAFRRPRHS CRL 


4324 


18225 


A 


4349 


2 


418 


YTLPDFPHHPHRIiSHCRLLQSRTAETYQ 
SPGACTPISQGHPGQRSSPCSSSSNPAC 
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beginning 

nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R-Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan ? Y=Tyrosine 7 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














* S S S S CGSTGFRCPGSAHPGCCS YVL * M 
GR*RS SQAPRAS P I AQGPRSAPAGRAGH 
PALPSPLP*GAPPPPSAASSPGL*SPA 


4325 


18226 


A 


4350 


401 


276 


DPLNVGGGGYS * PRLCHCSPTWVTEGDL 
VSKTNTHTOKKHL 


4326 


18227 


A 


4351 


446 


277 


AQDFVLIKDGEVIGGICFRMFPSQGFTE 
IVFCAVT *NEQVKVS AAPRPQ P QSTAHS 


4327 


18228 


A 


4352 


408 


3 


PKSSLFTSRPNGKSWSTKRPAISLAYSN 
ENAQGSWNGDQDAGPLLVENPRDYSTEL 
SVTIAVGASLLFLNVLAFAALYSRKDTR 
RQ* PLCQLSPHRGIGASELGHAPQ* ELP 
ALLLGPIRHECEAGPPHDTLPV 


4328 


18229 


A 


4353 


62 


280 


VRIGCLTSVPEASSCGLPTTPGCC*PLG 
TLPCRSRSRTRSFRSRSRTQSRSRPKTY 
SPGRRRRSRSRS PTPP 


4329 


18230 


A 


4354 


364 


3 


PVGEEGKSLVPKSPVEEKGKSPVSQSPV 
EE KAQYRVP KS PVEEAKS KAEVGKGDQK 
EE *EKEVKQAPKEDKVEKKEEKPKHVPE 
KKKAESPVKEEAVAEWTIAKSVKVQLE 
KETNEECI 


4330 


18231 


A 


4355 


3 


558 


LGP TLLWRRGKVP KANS P I PTS LRRTLP 
RASPGPSRGPLFMRQAGSTPMTLPRSTP 
SMS PML *MVWPPTAVPVP * KPLMWAPPA 
PL VL L VT I LTE I Q E PAT PAP L TQ F * KP T 
SLMVSRPVCPWQGPRTTRSTLCATTIA 
PSHATLRPGLSTTTSPLWQTLSLLLEGQ 
ASLPKG*NTSITLPSVS 


4331 


18232 


A 


4357 


258 


1 


LGFHDPGPAGWSQPHGGRSAQAQ*PPQP 
PT VP RT S LAD VS DAP LP P CGH S RPQHH P 
FRLPHTSFSTPGLSAVCLIVHPAARVSR 
CI 


4332 


18233 


A 


4358 


97 


343 


DRDLGDEYGWKQVHGDVFRPSSHPLIFS 
SLIGSGCQIFAVSLIVIIVAMIEHLYTE 
* VP TLNC I MS L L FLNS FNHWNL L S Q S 


4333 


18234 


A 


4359 


182 


2 


KDYFGSSVENGGEEELKQEDQLGGYSVI 
*ARDDEDEDDDDDDDDEDEEEDDEDEDD 
DDCI 


4334 


18235 


A 


4360 


88 


2 


RGS RTD TG * RGS DRGTG * RGS HTGTR * R 
GSRTDTG*RGSRTDTGGRGSRTGTGGRG 
SRTGTGGV 


4335 


18236 


A 


4361 


2 


250 


ELRDEGKASSAKQRLKCAGLQKFGERAF 
KAWAVARLSQRFPKAEFAEVSKLVTDLT 
KVHTECCHGDLFGCDEESAGLAKYIM 


4336 


18237 


A 


4362 


169 


456 


EQLLCAGQGQLS CEPLHLPS CP VGPQTP 
AHRP GRMAPGP P S F L S PGS P CGLGLAI P 
CSCQP WVAGVPDHHLFPS PMGRAGLS PS 
ERSAGEGVIPWP 


4337 


18238 


A 


4363 


1 


419 


PEFKLQKLKRSQNSAFLDIGDENEIQLS 
KSDWLS FTLE I VI ME VQGLKS VAPNR I 
VYCTMEVEGEKLQTDQAEASRPQWGTQG 
D FTTTH P RPWKVKL F TE S TGVLALED K 
ELGRVI L YPTSNS S KS AELHRMWPKN 


4338 


18239 


A 


4364 


375 


448 


. PGAVAHACNP S TLGGRSGRI MRAG 


4339 


18240 


A 


4365 


1 


444 


DFLTNHYLHFLR IAGSQLTGLGTAVQLY 
SAYEENNRTFLLAAVKRNHNQYVNPSGV 
AT F FE S I KE I LLRQ S GVKVE S VDHDS C V 
HGP CQNGRS CLRKLAVS S VL KS RE S LP V 
IIVANEPLQPFLCKCLPGYSDSWCEIDI 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R-Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














DEWLPFPW 


4340 


18241 


A 


4366 


52 


402 


CPQEVWFHS FCHCTSAFMKDLYFLCLVR 
FGDLKEDKVTRHDGAS S DGHLAHI FRHA 
AKE L FNE DVE E VT YRALR CGAVS TACL C 
LWLARVHGVPVWI CACGGSMDGVYTSQN 
QKDFF 


4341 


18242 


A 


4367 


119 


247 


WVLLS FQLHGVAVLGI ALI AMGEE IGAE 
MALRTFGHLLRYGEP 


4342 


18243 


A 


4368 


2 


430 


GGTWDGQPGFHGDSLSKAPGMHSLEQG 
MVGLN IGDVS S SAVKTVGS WS S VALTG 
VLSGNGGTNVNMPVS KPTSWAAI ASKPA 
KPQPIMKTKSGLPPSPIKHNMDIGTWDN 
KGPVPKAPVPQQAPSTQACPQSQQVGQT 
LiP 


4343 


18244 


A 


4369 


1 


406 


ETSTPEGEAGPIQRLDIPVENPVESKNI 
FLGAP L 1 1 CHVI DKRS PL YD I S ATDLADJ 
QDLEVIVILEGLGETTGISTRARTFYIA 
EEIQWGHRFVSIETEEQRPDSVDYSKFG 
KTDLVTTSRCNARELDEKPSILI 


4344 


18245 


A 


4370 


2 


284 


GTGTLCDLTALLSARYDGVRTCILiPCWK 
TISTI PVALL IH YHQNQAS CGKRAI I LE 
TRQHRL F CAD P KE Q W VKDAKP HLDRQAA 
ALTRNGGTFE 


4345 


18246 


A 


4371 


3 


184 


EDYNI LLS IMKGTTRP INKE I E ALNDT V 
DQL EIiTD L CRTLT PT I AE YS F FS SAP KV 
LiIQP 


4346 

t 


18247 


A 


4372 


44 


304 


GLPRSIGCQGLPRLPEWPTWVGGCHRGL 
IMPSVPTQVSERPLMFLiLDTPGVTjAPRI 
ES VETGLKLALCGE PGLGLGLGPGPLP P 
HLt 


4347 


18248 


A 


4373 


1 


422 


lhlfnpaasapsrslfsgpildplsral 
gvgpgicglasspgisegwdqirswthp 
pdpdhlsgfcrsqvymhpslsppstmil 
sggtalkppysafpgmqplemvkpqsgs 
pyqpmsgnqalvyegqlsqaaglsasqm 


4348 


18249 


A 


4374 


2 


423 


NSGTHT PGLLEDLS KNGRL PE I KLPVNG 
CSDLEDSFTILQSKDLKQEPLDDPTCID 
TSETSLSNQNKLFSDINLNDQEWQELID 
ELANTVPEDDIQDLFNEDFEEKKEPEFS 
QPATETPLSQESASVKSDPSHSPFAHVS 


4349 


18250 


A 


4375 


292 


429 


LKISVFPSSTEISLQQRLEYIARAILTA 
KS STG IAS I S ADGE S LRE 


4350 


18251 


A 


4376 


544 


32 


ALHACACLGLSPEACFLHLLQLTGTSPG 
PVDGQGLDNQGFRLLKAILGVSPAPEPV 
HGAS E HNAD TG VS VARAC VL P T S PTLAR 
TAAAKDESPPLAGLTMGPSREPGGSCLP 
LPSGLLSSPTRLALPRRPPSVSVWQDAG 
SSGSMPTEPLVQCGHRFYGTLVSWAVMV 
LK 


4351 


18252 


A 


4377 


378 


232 


PAKRAEEELLLHDTRCWLNGGAMPEARH 
PRTGAS ALHVAAAKG Y I EVMR 


4352 


18253 


A 


4378 


291 


1 


QMLSVDVASRYRAPSTYVLiNSLKEGMDG 
LHGESCSSFLLGPSVAMNMQTAGLEMGI 
CDGHFRQNVGCGYVLKPDFLRDIQSSLH 
PEKPISPFKGQSL 


4353 


18254 


A 


4379 


171 


458 


RGPLFLTLHIVNSLQGNNFRREYIVTQG 
PLPGTKDDFWKMVWEQNVHNIVMVTQCV 
E KGRVS KQL S WHQ S FAF CHLDR I L PHHQ 
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Amino acid sequence (A—Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny)alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine ? 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














SETPQNNVGIM 


4354 


18255 


A 


4380 


214 


410 


LDHLPTHPSPTGE IEVKWFEAVQTGLPM 
CILGAFFGPIRLGAQSLQVLDSELIPWA 
VQNGRIAPC 


4355 


18256 


A 


4381 


2 


74 


IGDSGVGKSNLLSRLTRNEFNLGL 


4356 


18257 


A 


4382 


290 


119 


I ATVS LNI VKMPRLPKAI YRFNAIP I KI 
PMTFFTEIDKTILKCMWNHKI I PGNSPA 
H 


4357 


1825S 


A 


4383 


2 


423 


LERVCWIKDI WAVTGENMEVMKSI IQK 
YQHKRI SLVEAGVTRHRS IFNGLKALAE 
DQINSRLSKPEWIIHDAVRPFVEEGVL 
LKWTAAKEHGAAGAI R P LVS TWS PSA 
DGCLDYSLERARHRASEMPQAFLFDVIY 


4358 


18259 


A 


4384 


356 


1 


WYPCCLLASLI CVFRGLFYLSLCLVLRV 
PVPLCLQFPGSLLGSLLLFSLLRQGLAL 
WPRLECSSAIRAHCSLELGSNDSPASAS 
QSAGITGMSQCAWPLSLLSVSPRVPGLS 
SVPLIF 


4359 


18260 


A 


4385 


1 


125 


SGSKKTLRSSLEEPTILDIEKFHRESFF 
YTHLINFSGKRYC 


4360 


18261 


A 


4386 


53 


215 


WQLYWTTWCQKI SRNQGLAHVPRME CS S 
MIIAHCNLKLLGSSDPPASASQIDPS 


4361 


18262 


A 


4387 


2 


390 


P SLAARVLARG YGNFSVFVIiWLRKNDRM 
HGE CAPNVS VAVS T S HT T I IGGGIRGGG 
GGGYGSGGSSYGSGGGSYGSGGGDGGGR 
GSYGSGGGNYGSGCAGSGHGSYCSGSSS 
GGYRRGSGGFGSOSrSGG 


4362 


18263 


A 


4388 


2 


441 


I KTR IS VI HKEE FGLTP I EGA I EDMKNK 
TLQLAFA I NQ EPS D AKMLQM VLKGSEGA 
TVNQGPLE VAQVFLA.E I PADP KLiYRHHN 
KLRLCFNEFIMRCGEAVEKNKRLITVDQ 
MEYQQELKKNYNKLKENLRPMIERKIPE 
LVKPIF 


4363 


18264 


A 


4390 


242 


24 


KLGNFLGGCVKESPFVPQGGMQWPDFGS 
LQHPPSRFKQFSCPTLPGNWMFRNAPLR 
SGKFFFSPVFLVETGY 


4364 


18265 


A 


4391 


1 


244 


VDQQMRQNIjFTTGAHHLQQANI QFRTD I 
ARTEYLSNADERLRWQASSLPADDLCTE 
DAIMLKRFTRYELGCQGEAWEECGW 


4365 


18266 


A 


4392 


2 


146 


LDLLSQPCRAVYIFAKKNDIPFELRIVD 
L I KGRSS LGFGE PKS QEGDR 


4366 


18267 


A 


4393 


204 


489 


GAASEHP I PKGESHGTSKAS S LCADAGP 
AGHFASVYNPLAWTVTTIVTLTVGFPGV 
RVTD E AGH P VP S Q VS G I QRCLQ GMKAP P 
VTVTVSGEKGS 


4367 


18268 


A 


4394 


2 


171 


LSDGFQPSRSIIFASWSAGDFGSVGATE 
WLEVFFIIPSHIGHELVGLGCCPEVMFL 


4368 


18269 


A 


4395 


2 


197 


WLELLKFYTLDCAIiEEYVICVRIQDILT 
RENNNWPKRRIAI EGEMI CC I S FVEYF I 
CATHYL I QC 


4369 


18270 


A 


4396 


2 


285 


TVDFPFQPYKCQQEYMTKVLECLQQKVN 
GILESPTGTGKTLCLLCTTIiAWREHLRD 
GI S ARKIAERAQGELFPDRALS S WGNAA 
AAAGDPIGPS 


4370 


18271 


A 


4397 


366 


461 


TSNFS CSLVLEEEASDYLELDT I KNLVK 
KYSQ 


4371 


18272 


A 


4399 


157 


3 


DVLRNNFARSAVLYLFIDRVTLWNAPRL 
ECSGAI IAHCSLKLLGSNDPPTS 
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Amino acid sequence (A==Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GJycine, H=Histidine, I=Isoleucine, 
KHLysine, L==Leucine, MNMethionine, 
N=Asparagine, P~ Proline, 
Q=Giutamine, R—Arginine, S=Serine, 
T=Tnreonine, V=Valine, 
W=Tryptophan, Y-Tyrosine 7 
X=Un known, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4372 


18273 


A 


4400 


2 


343 


DIDFKYHFDFFSVNFNEELVALYGGSLQ 
KQTKF VHE CI KT I LKLYKGQE FAPKS VA 
I IGHSMGGLVARALLTLKNFKHDLINLL 
I TQATPHVAPVMP LDRF I TGSTGWLKDH 
G 


4373 


18274 


A 


4401 


214 


430 


E LGN I LN YWRWE AE AEL SNRW FNGQAV 
HGELS P VTDFRE S C CRQYEMGE CTRGG F 
CNFMHLRP I SQNLQRQ 


4374 


18275 


A 


4402 


3 


227 


LiTQVS PQMTGHAGLNTAQAGGMAKVS EL 
KHFQ Y FLi PNRGL S LR VMRKMG P RGPLL F 
FLLGSPRATHAWKRQISF 


4375 


18276 


A 


4403 


158 


313 


NRDEFCHVAQAGLELLGSSSPPAATSQS 
ACITGVSYHVRPTLSNKFGCSSL 


4376 


18277 


A 


4404 


3 


628 


HCIREGGQDVPSNKDVTSLDWNSEGTLL 
ATGSYDGFARIWTKDGNLASTLGQHKGP 
I FALKWNKKGNF I LSAGVDKTT I I WD AH 
TGEAKQQFPFHSAPALDVDWQSNNTFAS 
CS TDMC IHVCKLGQDRP I KTFQGHTNE V 
NAI KWDP TGNLLAS CSDDMTL KI WSMKQ 
DNCVHDLQAHNKEIYTIKWSPTGPGTNN 
PNANLMLASASL 


4377 


18278 


A 


4405 


35 


180 


MCVDYHYFFCLLGPJSnSnSTNPQTSAVRTPT 
QTNGSNVPFKPRGREFSFGK 


4378 


18279 


A 


4406 


1 


438 


DFQRPRDDHGDVDWEKLVELLTDCSNLQ 
DQT Y I L Y I L YV I KG P S WD TNL S GQNG VT 
VQNLLGELYGKTGLNQEWGLIPYISSLF 
RKKVD VIiAE ACTD LLS DQ KQ LT VGLS DE 
PREKIIYAPLPPKELTKLIYEASGQDII 
IGVLTQ 


4379 


18280 


A 


4407 


1 


428 


TE S VNAYFKGAD P TKCI VKI TGDMTMS F 
PSGIIKVFTSNPTPAVLCFRVKNISRLE 
QILPNAQLVFSDPSQCDSNTKDFWMNMQ 
AVTV YLKKL S EQN P AAS Y YNVD VLKYQ V 
SLNGIQSTPLNLATYWKCSASTTDLRVD 
YK 


4380 


18281 


A 


4408 


174 


383 


KNS W PGAVAHACNP S TLGGKGRQ ITRSL 
YRDHPVQHGEIPLYLKLTALAVTFLGLL 
TGLHDLYLTTNLT 


4381 


18282 


A 


4409 


1 


288 


LMAEKDS LDP S FTHAMQLLTAGKGASRV 
PLGRPAI PGMSGPGFVPLASRPLGTEAV 
GSSGLILIFGSLDSTFPPQPGVRGRISR 
LQWRPEGDFAIP 


4382 


18283 


A 


4410 


292 


422 


LVHSSCFQVSSKILELARKQRMNTDIRR 
NIFCTIMTSEDFLDA 


4383 


18284 


A 


4411 


412 


238 


FFSRHRVSQWDQAGLELLTLSDLSALAS 
H S AE I TGLNHHAQ P LT TNFNKY CW I S LQ 
VH 


4384 


18285 


A 


4412 


1 


405 


VTSSDKSLKVLLDAEDKVFNEIRNEHFS 
NVFGFLS QKARN LQAQ YDRRRGMD I KQM 
KNFVSQELKGLKQEHRLLSLHIGACES I 
MKKKTKQDFQELIKTEHALLEGFNIRES 
TSYIEEHIDRQVSPIESLRLMCL 


4385 


18286 


A 


4414 


2 


149 


GFHHVGQAGLEVLTSDNLPVSASQSAGI 
TGVSHCAQLRSDGFKKRSSPA 


4386 


18287 


A 


4415 


1 


366 


D I CHRKLGVE CPSHAS I NRLWQWS S I 
TASLRFEGPLNVDLIEFQTNLVPYPRIH 
FPMTAFAP I VSADKAYHEQFS VS D I TTA 
CFESSNQLVKCDPRLGKYMACCLLYRGD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, ENHistidine, I=lsoleucine, 
KHLysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu taurine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














WPKEVNAAI 


4387 


18288 


A 


4416 


I 


164 


RRRCEMSTMFADTLLIVFISVCTALLAE 
G I T WVL V YRTD P FGTLAG S E G P GGRY 


4388 


18289 


A 


4417 


2 


141 


DIAI PCNNKGAHS VGLMWWMLAREVLRM 
RGTISREHPWDLTSSVMG 


4389 


18290 ' 


A 


4418 


136 


1 


EQTKAFGQLHKVLGMDPLPSKMPKKPKN 
ENPVDYTVQIPPRDPMQ 


4390 


18291 


A 


4419 


1 


280 


GNYCKYYGYRGPS CEDGRLRVLKPE WFR 
GRDVLYLGCNVGHLTLS IACKRGPSRMV 
GLDIDSRLIHSARQNIRHYLSEELRLLP 
QTLEGDPGA 


4391 


18292 


A 


4420 


3 


446 


TVS TS QVRQNYHRDS QAAINRQ I SLELY 
ASYVYLSMPYYFDRDDVALKNFAKYFLH 
QSHEDRDHADKLMKLQNQRGGRSLLQDM 
RKP D CDD WE S ALNAME WALHLD KNVNQ A 
LLELHKLTTDKNDPDLSDYIETHYLNEQ 
VTAINELG 


4392 


18293 


A 


4421 


6 


182 


DPSFCSEPRSCHCTPTWRTEQDSISEKK 
KKRUI LKNWAKN Y I TLE SEKERKSKQRS 
RR 


4393 


18294 


A 


4422 


1 


456 


GPISYVWMPKKRQALVEFEDVLGACNA 
VN YAADNQ I Y I AGH P AF VN Y STSQKISR 
LGDSDDSRS VNS VLLFT I LNP IYSITTD 
VLYTI CNPCGPVQRI VI FRKNGVQAMVE 
FDSVQSAQRAKASLNGADIYSGCCTLKI 
EYAKPTRLNGSK 


4394 


18295 


A 


4423 


2 


257 


FHHVGQAGLKLLI SVDPPALDSQSAE IT 
GVSHCAQPVLCILNQLCAKAVLSPDSHP 
YPPKSGRHEGQVREQKMSPCSYIYFKRT 
V 


4395 


18296 


A 


4424 


106 


259 


CCCFKYFFSNVKI CFYRDLKPENILLNE 
DMHIQITDFGTAKVLSPESKQGV 


4396 


18297 


A 


4425 


3 


364 


MDE IEKYQEVEEDQDPSCPRLSRELLDE 
KE P EVLQDSLDRCYS TPS GYLE IjPDLGQ 
PYSSAVYSLEEQYLGLALDVDSEYLTVK 
VISLHLVFQIGVIFLFQVPLTDPRDVIA 
AGRTYGRI 


4397 


18298 


A 


4426 


2 


241 


HHVWLI CFVFFVCVETGFCHVAQVGLER 
LGSSDLPTSVSLSAGITDVNHWAQSKPS 
P SKGTGMNRP IGLRVGKG VWSRL, 


4398 


18299 


A 


4427 


2 


438 


TRLIGPNCPGVMNPGKCKMGIMPGHIHK 
KGRIGIVSRSGTLTYEAVHQTTQVGLGQ 
SLCVGIGGDPFNGTDFIDCLEIFLNDSA 
TEG 1 1 L I AE IGGNAQENAAE FLKQHNS S 
PMS KP VES F I AALTS S TGRKMGHAGQLL 
AEEKV 


4399 


18300 


A 


4428 


34 


427 


AGMLPAVGSADEEEDPAEEDCPELVPME 
TTHSEEEEKSGLGAKIPVTIITGYLSAG 
KTTLLNYILTEQHSKRVAVILISrEFGEGS 
AIiEKSIiAVSPGGELYEERLEIiINACLCC 
S VKGNGL I AI ENLMRQKGE 


4400 


18301 


A 


4429 


330 


8 


YSDRISHWIDMCPSKYIFRGPVHPPHIS 
TFLHTAHNLVARGNYENMS PHDEEKGAY 
QRKEKCCDDPMAGAARIHFLCFFEMGFR 
SVSQAGVPWPHITSMQPQPPRIK 


4401 


18302 


A 


4430 


446 


222 


LTSSDPPASAPQSAGITGVSHRPQPIWG 
VS I Q L Y F S GCP DTP GQ GRLTREHCDGQD 
GEQGGPCSESVAQFCHDWS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 


4402 


18303 


A 


4431 


3 


243 


AREDSADFVSFFPDFVWTLRDFSLDLEA 
DGQPLTPGTSQKD3NINFILPRLC IRKFTS 
LIRTSRSYQSPDTVSVASSRGAR 


4403 


18304 


A 


4432 


433 


0 


PPPSPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPSPSSPSSSS 


4404 


18305 


A 


4433 


25 


185 


ARWVAC S E LRLRC C T P AWAT E P E P VFKK 
KKKVQQQGGWPGGLTPVSYRFGRLT 


4405 


18306 


A 


4434 


2 


329 


VRD I KEKLCYVAVDFEQEMATVE FSSSL 
EKSYELPDGQVITIGNERFRCPEAGIQP 
S FLGMES CGMHETTLNS IMKCDVDIRKD 
LYANTVLTGGTTMYPGIADRMH3STEI 


4406 


18307 


A 


4435 


2 


374 


WVTFISLLFLFSSADSRGGFRRDAHTSE 
VAHRLKDLPEENFKALLLIAFAHYLHQC 
P FEDH VELLNE VTE FAKTC VADE SAENC 
DQSLHTLFGDTLCTVATLRETYGDMADC 
CAQQEPERNECF 


4407 


18308 


A 


4436 


2 


352 


KVRRIDSRLAELDREIRNPQISCQLIYS 
PSLFLGAFFLFFFFGFFFPPPYIiLAPKN 
QFNPPPLCTGGPVPRPKPPIPPWEPPRP 
APPLFSPQWDLGFQIAPFLWVINGGGPS 
GGSRA 


4408 


18309 


A 


4437 


2 


369 


QSCETLFHSWKDVEVCSSALSCLSQVSV 
HLQGLESLFILPGMEVEQRDSQMALVES 
LEYVRGEISKAMADFTTWKTHLLTSDSQ 
GGNQMLDEGFREDFSEQME IAIRAILCA 
IQNLEERKNE 


4409 


18310 


A 


4438 


2 


231 


SDTSRPQLPCPAARERFPDGPLSLRPLP 
FFLSQVYNEQIHDLLEPKGPLAIREDPD 
KGVWQGLS FHQVWDWARVG 


4410 


1S311 


A 


4439 


1 


365 


MAHAMEEVKKCLGPDMMDDI CHEQFLEL 
S Y LNGVP E P SRGRGVP VRGRRAAP P P P P 
VPRGRGVG P PRWALVRGTP VRGAI TRGA 
TVTRGVP P P PTVRGAPAPRALTAVMQR I 
PLSPPPATK 


4411 


18312 


A 


4440 


92 


510 


IiAALPDGGQELHIPRSRAGPAGELSWAV 
VGTGALSSLSILFQMARCRPWPPLRPNS 
PIiQGWIFNWPWLVAIPASLFSGFLSDH 
L INQGYRAI TVRKLMQGMGLGLS S VFAL 
CLGHTS S FCESWFAS AS IGLQTFNHR 


4412 


18313 


A 


4441 


2 


266 


FPPHLPELGKWECPWHQCDECSSAAVSF 
CEFSPHSF CKDHE KGALVPS ALEGRLCC 
S EHDPMAP VS PEYWS KT KCKWES QDHGE 
EVKE 


4413 


18314 


A 


4442 


2 


279 


KGAELVFLPSNTRSLMYPLDQGWRTFK 
AHYAGYSMERIVSAMEEMPGREKIMKKL 
LKLSS PKE I PAGEKLCPNDAHDFTELDD 
RVIHENHE 


4414 


18315 


A 


4443 


2 


371 


KWVTFISLLiFLFSATDSRGVSRLDAHTS 
EVAHRFKDLGEENFKALEAIAFAQYLQP 
WPFEHHVELGNDVTES AKTCGADE S ADN 
CDKSLHTIiFGDTLCTVATLRETYGEMAD 
CCSQPEPCEKG 


4415 


18316 


A 


4444 


2 


372 


W VTF I S LL YLFS SGYS RGVFRRDAHKS E 
VAHRFKDLGQENFKALVLIAFAQYLQQG 
SFEDHVKLMNEVTEFAETCVADESSDDC 
DKSLHTLFGDTLCTVATLRETYGEWADC 
CAQQEPERNEC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GIycine, BNHistidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4416 


18317 


A 


4445 


1 


283 


FRKNAED ILTMEGLRATMKHELEAAQKK 
HSLWELLRIPNICKRIWILSFVRSAGVC 
WRSTPDP VCLGITSGGCRTAQ I PACS FL 
WKLHSIGAPA 


4417 


18318 


A 


4446 


173 


2 


KKKKCVLYPEGHGVFHALVCVCVCVCVC 
VCVCIQVNFRNFCFISWINLSVQVSLLD 

y 


4418 


18319 


A 


4447 


104 


221 


S CHLH YS LGDKS E TLS KNHHYHHHHHNH 
HHHPAGYFKMK 


4419 


18320 


A 


4448 


241 


1 


W VS WD KAP S WVRGLG S GSRGGVWGGG S P 
PHRVTDVSCSFLPDGRHVWEMEAKTDRD 
LCKPVSAFGAAGATVGPQGLGGRC 


4420 


18321 


A 


4449 


229 


349 


WKRSHSFGLDWAHACNPSTFGGQGGRI 
MRSGVQDQPGQY 


4421 


18322 


A 


4450 


76 


189 


TESKPAATTRSSGGGGGGGGKRGGKKDD 
SHWWSRFQKV 


4422 


18323 


A 


4451 


40 


359 


SGEPLFPFFFSLARLIPGVCFVEMHTRV 
RLLIGLKIWEKKVPQVSTPTLVEVSRNL 
RKVGS KCCKHPEAKRMPCAEHYLS WLN 
QLCVLHEKTPVSDRVTKCCTES 


4423 


18324 


A 


4452 


2 


371 


KFQNALLVRYTKKVPQGSTPTIVKVSRN 
LGKVG I KC C KHS E AKGMP CAE D YL S WL 
NQVCVLHEKTPASDRVTKCCTESLVNRR 
P CFS ALEDYET YVP KE CNAETFTFHAD I 
CTLYENERLFQ 


4424 


18325 


A 


4453 


118 


369 


GRYILLLKTKKQTANNNIKTPQYLSNMS 
KKFRHSEFFFFYLKWSLALQRCNLGSLQ 
PTPPRFNQLSCLSLPNSWDYRCVPPNLV 


4425 


18326 


A 


4454 


364 


3 


GHLSLQRLLLPFVWLCPAPRGGAYRGRQ 
AS LS CGGLHP VRAS RLL CL PNQACTMAG 
AP PPAS LLP CSL I SD CCASNQRDS VGVG 
P S E P CAG YNLLVCRFL S P SE KR S P S AG V 
MRFSRCRL 


4426 


18327 


A 


4455 


1 


379 


AAFTE CCQAADRAACLLPKLDE LRDEGK 
ASSARQRLKCASLQKFGERAFKAWAVAR 
LS QRFP KAE FAEVAKLVTDLTKVHTECC 
HGDLLECADDRSDLAKYICENQDS I SSK 
LKECCEKPLLEKSH 


4427 


18328 


A 


4456 


1 


362 


LRPARSLVFPWFAPGGSGLRGLKLLEAK 
CQGDGVSYEETTIPRPSAYHNLFGLPLI 
SRRDAEWLTSRELDSLALNQSTGLPTL 
TLPRGTTCLPPALLPYLEQFRRIVFWLG 
DDLRSWEA 


4428 


18329 


A 


4457 


250 


330 


EWWTHLWLNEGFASWIEYVCVDHCFPE 


4429 


18330 


A 


4459 


96 


319 


YLTIPFSQAMENCKDARLTTSIGAYNFN 
HRLLEMILNKPGLTYKPVC3STQVSALSLL 
S FLFFMPLFLS YCQVS XH 


4430 


18331 


A 


4460 


50 


339 


NSQTLKLSNVQCLKTRALFFFPKGI YPQ 
IKTLPPCPTSGPDPYVRVNLLPKGKGPC 
LKKISGNRKTLKPLFDKTLKFLVPMEKV 
PKSPLNVPVKNI 


4431 


18332 


A 


4461 


2 


357 


G YDGVKRWTRDGD I FNKELLL I P I HLE V 
HWS LI S VDVRRRT I T YFDS QRTLNRRCP 
KH I AKYLQAEAVKKDRLDFHQGWKGYFK 
MYCKHLALSQPFSFTQQDMPKLRRQIYK 
ELCHCKF 


4432 


18333 


A 


4462 


182 


325 


IRLSSWLCDCASCQVYEFRVKESS I IAP 
APAEDVDTPPRKKKRKHRL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JLysine, L-JLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \— possible 
nucleotide insertion 


4433 


18334 


A 


4463 


1 


338 


SRHS PGPTPQPD CSLRTGQRS VQV3DTS 
SCSQLSSSSGSSSSSVAPAAGTWVLQAS 
QCSLTKACRQPP I VFLPKLVYDMVESTD 
SSGLPKAASLLPSPSVMWASSFRPLLSK 


4434 


18335 


A 


4464 


2 


343 


I IDFFLGAYLKDEVLKIMPVQKQTRAGQ 
R TR FKAFVAI GD YNGH VGLG VKCS KE VA 
TAIRGAI ILAKLS I VPVRRGYWGNKIGK 
PHTVPCKVTGRCGSVLVRLIPAPRGTGI 
VS 


4435 


18336 


A 


4465 


1 


371 


INSTLKMSFVGENSGVKMGSEDWEKDEP 
QCCLEEIFFALASSLSSASAEKGSPILIi 
GVSKGEFCLYCDKDKGQSHPSLQLKKEK 
LMKLAAQKE SARRP F I FYRAQ VGSWNML 
ESAAHPGWFIC 


4436 


18337 


A 


4466 


246 


12 


LCFTPLPSYILFFFETKSHSASPKLECS 
GVI TATSE VI LLGGGGCS ELRS CYCTPT 
W VTKLDS I TKKRRKRKRKDL Y 


4437 


18338 


A 


4467 


2 


355 


WVTFISLLFLFSSAYSRGVFRRDAHTSE 
VAHRFKDLGEENFKALAL I AFAQYLQQC 
PFEDHVKLANEVTEFAKTCVADESAENC 
DKSLHTLFGDKLCTVATLRETYGEMADC 
CAKQEP 


4438 


18339 


A 


4468 


2 


361 


EDHVKLVNEVTE FGKTCVADE S AENCD K 
SLHTLFGDKLCSVATIRETYGEMADCCA 
KHEPERNECFLQHKDDNPNLPRLVRPEV 
DEMCTAFHDTDETFLKKYLYE I ARRHPY 
FYAPELLL 


4439 


18340 


A 


4469 


1 


373 


SSQPSSPSESVSGTVSVSPSSLPSSPCIj 
SLSVCLLVCVSLSMFFSLPVCLSVPLCV 
SPRPHPSVSHAHPHRASALVNCPWGPAP 
TS AP VS F YKPQS PALKTGQAS PHPHPHP 
FPTPSPMQSQKK 


4440 


18341 


A 


4470 


236 


371 


KVLGAVLKDKGGLFCFVFGDRVSLCHPG 
WSAWQ S QLTAAVTSQA 


4441 


18342 


A 


4471 


345 


201 


AWMTAHLFMAWFTEYFKSTAQKKKFVFK 
TLLLIDNFDPLWDSKEGKL 


4442 


18343 


A 


4472 


1 


144 


agfhhvgeaglglltsrdppaspckrag 
itglnhhtwpfsviltytyy 


4443 


18344 


A 


4473 


3 


333 


KNHLEASIGEHPFFPDHSPSCPHLLSSS 
NYHSLSCFLGGFFSETGSAPSPRLECSG 
V I AAHCS LKLLG S T S P LT S AS WVAG I TG 
MCWLYS PAVS PRPS PDLTCTSQDENI 


4444 


18345 


A 


4474 


288 


1 


TSAVNR I S I KSQADQEATEQKLRD YSHQ 
GIQILAWQKFEYKPGKVTMDPEDCILQL 
.ftJEClXKE EEEEEEEEEEEEE1EEEE EE E E ET) 
QEEEEEEGEEEE 


4445 


18346 


A 


4475 


3 


124 


PRLECSGMIMAHCSLDLLDSSDPPTLAS 
QVAGTTKKISLH 


4446 


18347 


A 


4476 


196 


2 


SCYEHPGRIVYGFLFLRQDLAVTQAGVP 
WHDHGSLQPLPPELKRFSGTSHLLSSQG 
YRRAQPRR 


4447 


18348 


A 


4477 


3 


162 


LLERNKELEGSLQQMYSTNEEQVQEIEV 
RALHVLAHTPRGPLGLCVPLCNLE 


4448 


18349 


A 


4478 


167 


2 


ACGISPFSLSSSCFGHVRHASLPLAFHH 
DCG F P E AS P AML P VQPS QLNL F I KYP V 


4449 


18350 


A 


4479 


195 


26 


WVSMLDIHDNGPHEVAVICFVCFIiSQSL 
ALLPRLECSGVISAHCNLQPGRQSQTPS 


4450 


18351 


A 


4480 


175 


1 


NRSATKCDWGEKRTSQELNLFFWFFVCL 



825 



WO 01/64835 



PCT7US0 1/04927 



SEQ II> 
NO: of 
nucleotide 
sequence 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Afanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














FEMESHSVAQAGVQWRNPGSLKPLPPRF 
KR 


4451 


18352 


A 


4481 


122 


357 


YHAFL YQWYNLKREHKRFRCS LLI I FLG 
GE RKLQ S I QL WN I SHY I Q I RR Y VLE IPS 
GLGAVAHACNPSTLGGQGEWIT 


4452 


18353 


A 


4482 


137 


2 


FYSYLKILEEMGPHYVPQAGLELLGSRD 
PPDS AYQSTE I IGMSHH 


4453 


18354 


A 


4483 


347 


208 


GQAGLEDLTS SLPASASQSAG I TGAGHH 
TRPRIYFLKKYLAFVLGS 


4454 


18355 


A 


4484 


411 


268 


RLNRRGGGCSEPRSCRCTPAWATEQNS I 
SKKNQQKTTKLPLCCRLILL 


4455 


18356 


A 


4485 


2 


102 


PCLLKMQQLAGHGGMHLCSQLLRRLRCF 
IiHSYP 


4456 


18357 


A 


4486 


1 


113 


HS AHS THAGHAGH TS LP KC WD YRS E P PR 
LAKFLNTRY 


4457 


18358 


A 


4487 


163 


353 


QVAATLNNLAVLYGKRGKYKEAEPLCKR 
ALE IREKVQKKGAWFFE I FVLCVCGFF 
FFFKRDP 


4458 


18359 


A 


4488 


408 


307 


RGCSEPRSHDCSAAWLTERDSVSKKNKN 
KSPKKQ 


4459 


18360 


A 


4489 


337 


179 


YSINFALILIFLFYLGVLVWAAIINYHR 
LHGLNNKHLLIMIWMVGSPRLSCG 


4460 


18361 


A 


4490 


229 


345 


LENETLGRARWLTPVIPALWEAEASRSR 
GQEIETILANM 


4461 


18362 


A 


4491 


180 


365 


ENNLKTLVALDMSFYFILFLFYFILFYF 
ILF YFLRWS FALVAQAGVQWRYLGS LQ P 
LPPRFK 


4462 


18363 


A 


4492 


269 


385 


GNLQNLEKVFYKIQHPFMIKTLNKLVID 
RTYLNIIKAIY 


4463 


18364 . 


A 


4493 


103 


336 


QLLVS L INLFLQAFKLQ I DCGHTCMHFW 
RQNLCLLPPTLPTPRRNLTLSPSLECSG 
TIWAHRKLRLPGSHHSPASATR 


4464 


18365 


A 


4494 


305 


379 


YIKIENFCLGWAHTCNPSTLGGRG 


4465 


18366 


A 


4495 


47 


224 


ETATKTQRRWETALVPQHPHRPLAPVTW 
VKTTGNSTI ANFLL ILVYLS FLFLVYS C 
IQQ 


4466 


18367 


A 


4496 


3 


129 


ITLNPGGRGCSELRSCHCTPAWATERDS 
VSKKKKKFPPPPF 


4467 


18368 


A 


4497 


396 


122 


LPELEFEAAVSHDCASELQLGHQSKTLF 
QKKINRI1NNNNNSHIAISLLGIYPKEF 
KAGTQRDICTPMITASLFTIVKRQKQPK 
CLATNELVYRM 


4468 


18369 


A 


4498 


1 


152 


LSLPGSQTWWRALIVPATWEAEVGGSPE 
PRRSRLQVAMITPLHSSCGRRG 


4469 


18370 


A 


4499 


92 


2 


KRP G WTHACN P S TLGGRGGP I T KSWR 
DQ 


4470 


18371 


A 


4500 


116 


1 


AGIAGMCLHAQLI FLYFLVETRFCHLGQ 
AGLELLTLWS 


4471 


18372 


A 


4501 


99 


1 


QGTPVHRWWKCKLVQPLWRTVWS FLRKV 
KIDNS 


4472 


18373 


A 


4502 


140 


288 


RQFHYVAQAGLELLGFSNLPALTSQSAG 
ITGASHHNQPTSRFLKEKKIG 


4473 


18374 


A 


4503 


330 


181 


FFFFLFFFFFFFFFFFFXXFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFF 


4474 


18375 


A 


4504 


171 


2 


VETVS KESSLRI CLQDIR YFFFETKPHS 
IAQAAVQWHYQGSLQPQPLRLKQSSRLS 


4475 


18376 


A 


4505 


345 


74 


SVERTCHSPKPLMLFLPLVFQARDDILN 



826 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 
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Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine 7 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GSHPVSFEKACEFGGFQAQIQFGPHVEH 
KHKP G F LE ACHP YQ TAE PAS ALT LD F PA 
SRTVQK 


4476 


18377 


A 


4506 


268 


102 


KNIRELWPGAVAHACNPSTPGGRGRWIT 
RSGDWLNLRTCEHVILQQKGKFRFLIS 


4477 


18378 


A 


4507 


1 


67 


GLPKCWDYRREPPHPARNCKYF 


4478 


18379 


A 


4508 


339 


180 


ANFFVFFEEMGFHRVAQTGLEVLGSSDL 
PILASKSAGITGISYSVRPQTNTE 


4479 


18380 


A 


4509 


174 


3 


NEPKWIYI I PGDPLFFLLLLFFETESH 
YVTQAGVQWHDLGS LQPLP PRLQRFS C V 


4480 


18381 


A 


4510 


84 


3 


AHHI FTVGIDVDTRAYFTSATI I IAI 


4481 


18382 


A 


4511 


127 


2 


LLIKIHCWPGWAHACNPSTLGVRGRWI 
TRSGVQDQPEQYG 


4482 


18383 


A 


4512 


109 


345 


EIFLIQGHEGSCLCFLFGSFMCFVATGF 
PHVTWL I LTLSNLPTS SYS RAG I TGVSH 
HARANNLFCFGSMPSHGITGLNF 


4483 


18384 


A 


4513 


290 


13 


RDLRPTPQKGPKNHTAPNPLQRAQTHVK 
KGRGPTPPTPTPSKDPGPHPPLRKPKPW 
GRVWGE KKKKNKNGP KDKKGKKKKKKKE 
RKIQHSRF 


4484 


18385 


A 


4514 


234 


362 


DGDVLS KFVNFFS FKWEGTPL I DGRRL 
KYRLNGDFLFLRLKL 


4485 


18386 


A 


4515 


190 


330 


CLALQSECFRRWSFALIAQAGVQWRDLS 
SPQPLPPVFKRFSSLHPFE 


4486 


18387 


A 


4516 


2 


199 


.CVCVCVCVCVCVCI FFFFLSKKWGPPPL 
G P PGG P KTLGP KE S P P LAP KNRGNS GL T 
PPPPPTFFF 


4487 


18388 


A 


4517 


94 


339 


SAYLGLKFFLFFFFFFFFFFFGVFPPFF 
FPNKKKLKPPQIFFPPQKRPPIFFFPPP 
PGGKNFFFFGKFFFNPREKGPPKNLG 


4488 


18389 


A 


4518 


95 


3 


L YLGPGAVAHACNP S TLGGRGGQ I TRSG 
VQ 


4489 


18390 


A 


4519 


14 


121 


LIFFGRNRISLCCPVWSQTPGLKPSSWL 
SLPKCSD 


4490 


18391 


A 


4520 


207 


2 


ISREKLCTVRIPDMHVIEDSILFYYLFY 
SLALPPRLECNGTIMAHCSLHLPGSSNP 
PASAS Q VAGTTS 


4491 


18392 


A 


4521 


169 


1 


ANLTQVGSYYSLSLHTLIQVTFFKNIRA 
VRPGT VAHACNPRTLGGPDGWI TRSGDR 


4492 


18393 


A 


4522 


2 


82 


GFILCVCVCVCVCVCVCVCVLYFIFF 


4493 


18394 


A 


4523 


300 


3 


LFLPPSPLPSQXPGGQKPLPLIFPPRVP 
KKFIPPNFWAPPLISPRPFWVRFLGPPT 
QGFNYPLGPGPPPPPPFSLGFKKSPNPF 
FIKKKFSPPPKKQK 


4494 


18395 


A 


4524 


348 


94 


KPGEGGKGPQNPPPGGLKPGNPLNPGGG 
GSPEPNFPPWPPPRGAKKKTRPQKKKKK 
KKGRKESRLNYGMEKRMKEKEKVFLSAL 
N 


4495 


18396 


A 


4525 


313 


161 


RGHPKNLKS KEVNIRREAPGHPRRNKKK 
TPNLKKKKKKKKKENKKQKQKP 


4496 


18397 


A 


4526 


118 


3 


FF FF LRG S FALVAQAWQ WRYL S S LQAL 
PPGFKRFSCP 


4497 


18398 


A 


4527 


153 


2 


KD AINKMKKW PG WAHACNP S TLGGQGG 
WITRSGAQDQPSHYGETPSLPK 


4498 


18399 


A 


4528 


258 


2 


GGQKRGLPGFIKKPPFRKIFLKNLGEGP 
GKNSLSKFFFFFFFETESHCLTQAEVQW 
LNLSSQQNKPPRFKQFLCLNHPSSWDYR 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=L.ysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va1ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














R 


4499 


18400 


A 


4529 


296 


39 


SIRIINIQILLGFFETGCHSVAQAHLKL 
LGSND P PAS AS E S TE I TEMQH CTRLS I Q 
I F VAF P YSNI KLS TKKEP I L I KLLL YNC 
K 


4500 


18401 


A 


4530 


292 


37 


SIRI INIQILLGFCETGCHSVAQAHLKL 
LGSND P PAS AS E S T E I TEMQ Q C TRLS I L 
I FVAF P YSNNKLS TKKES IL IKLLLYNR 
K 


4501 


18402 


A 


4531 


3 


138 


GLWNGS CLCVCVC I FMGMGACLVCICTC 
LYCCVPVNTCLCMDGR 


4502 


18403 


A 


4532 


380 


2 


GGP P KKRE S AKAP GGRNPQGGGRE KT P P 
P KKKR P P Q KGGG KKT P PGGGGG P P RGGG 
KKKP P PKREKE CHRGGRGGGGP P WTP K 
EGPPFFFFFFFFFFFFSLPFFFFFCFFI 
FILFFPNIVINGFY 


4503 


18404 


A 


4533 


3 


191 


LSKITHAS I PVPVSKKKKKKKKEKKKKK 
KKKKKKKKKKKKKKKGGGGFKKTLGGHN 
FSRGLE 


4504 


18405 


A 


4534 


370 


48 


GPAPEVSSNGGKGGGCPKGLGKPLGSQF 
NGVKGYPSPKSSPLFQAPLGGQGFPKPR 
GNGGT P P F F P KKKKKKC ALGDGT P TSGC 
PRRSWNHPETQDGCGGWGSRVPR 


4505 


18406 


A 


4535 


147 


1 


KGPPLFFFFFFFFFFFFFFFFFFFFFFF 
FFFFSFYFYI IYLFFLSFLRL 


4506 


18407 


A 


4536 


198 


I 


NPPYFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFLSFIFIFLYFNFI 
SSNIVFIGFY 


4507 


18408 


A 


4537 


2 


356 


QNRTTGNSKTQSASPPPKECSASPATEQ 
SWMENDFDELREEGFTRSNYSELREDIQ 
T KGKE VENFE KHLHE C I TT I TNTE KCLQ 
ELTELKTTARELRDECRSLTSRRDQLAE 
RVSALE 


4508 


18409 


A 


4538 


347 


220 


KCSGTI IAQCNIKLLGSSNPPVSASQSA 
GITGMSHHVQPKYC 


4509 


18410 


A 


4539 


308 


2 


FS ISCPHDTPAAASQSAGITGVSHCARP 
IFSFICFFFICIMCIKYLLRTGDSVMYF 
HGLGCFGLCCYLESYIRKLSHMKKLSIL 
ITSREKCKCILQINIGDR 


4510 


18411 


A 


4540 


2 


194 


I KQWS TALS PRLECSGT I TAHCS LKLP 
G S AS AS WARTTGT CHHAQL IYFSETGS 
HYVDVDAA 


4511 


18412 


A 


4541 


101 


2 


MEENLGNTIQDIGRGKDFTSKTPKAMAT 
KAKID 


4512 


18413 


A 


4542 


1 


84 


AEEAEAEAEAEEEEEEEEEEEEEEE IKY 


4513 


18414 


A 


4543 


3 


183 


KIQILFFVSSKTRLRPSAWHPRFVFCFL 
RWSLALAAQAGVQWHHLSSLQPIiLPGFK 
VDAA 


4514 


18415 


A 


4544 


1 


335 


RKKKKKEKKKEKKKKKQGRKKRKRERKE 
EEAKKKTKNKDK 


4515 


18416 


A 


4545 


104 


2 


QKKKKKKKKKKKKKKKKKKI YDFFLQ IN 
CMCSHK 


4516 


18417 


A 


4546 


5 


108 


FKRLSCRSFPSSWDYRREPPRPANFCIL 
SRDGIS 


4517 


18418 


A 


4547 


131 


1 


RPRRPQVGLQDATSPIIEELITFHDHAL 
IIIFLICFLVLYALF 


4518 


18419 


A 


4548 


156 


3 


KGGQGMSCLSRSPSALLKFECRVISSYW 
AWWRAP WPATRGAEAGES LEP 
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nucle- 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


4519 


18420 


A 


4549 


2 


340 


LNPGSAGCRELRSHHCTPAWTTERGSVS 
KKKKKKKKKKKKKKKKKKKTFFGGGPLW 
EREFLLGNFWEKKGGGANFFPLENPNGP 
QLKTWGKKAFFPRKGGGPLNPPQGFYQG 
A 


4520 


18421 


A 


4550 


165 


335 


PHYLPRRSI IHRPFKSHSPEPGAVAHAC 
NPSTLGGRGGWITSGQEFETSLINMEKS 
L 


4521 


18422 


A 


4551 


123 


3 


GRVDRLSCLSLLNSWDYGHLPLiHLANFC 
IFSRDGVLSSW 


4522 


18423 


A 


4552 


3 


131 


HPEVDIPILPGLGIISDIGAYYSGISEP 
FGYVGMDVSYDPVRL 


4523 


18424 


A 


4553 


87 


364 


ILASQSWFRYAASAASGGLLETESQPS 
AAGTASAAVPSLEAQTPAAFRPTPPGLA 
PSSPPAPGPPAPTGKVCGRHFL>RSESSA 
SGPLRALL 


4524 


18425 


A 


4554 


114 


0 


LTWAPHSLIETIRNQIIQALLITILLGL 
YFTLLQRSE 


4525 


18426 


A 


4555 


91 


3 


TTTTYIAIATACVCVCVCACVCPCVCVC 
M 


4526 


18427 


A 


4556 


239 


364 


CIKITFSRPGAWHACNPSTLGGQGGRI 
AMS RDRDHPGQQGK 


4527 


18428 


A 


4557 


250 


1 


STEHVKQLRNEFMKISVILIFLEILLYI 
KVHLKLCTLTTDHIiLLGIKSLiKILFRPG 
AVAHACNPSTLGGRGGRIPRSGDRDHS 


4528 


18429 


A 


4558 


2 


90 


QDGLDLLTWGSACLiSLPKCWDYRSEPSL 
F 


4529 


18430 


A 


4559 


65 


394 


DPVSKKKKKKKKIKNEKEKKFFGKKKPL 
KKKLRADLILKPKKKNFLKKREWKKREK 
GGDPFWKNKPQVGKTGNNLNLGQNGLGK 
GPSDLKKNTPSTKLNIiVQKKFQIFF 


4530 


18431 


A 


4560 


93 


2 


G 1 1 S AH CNLCL P G S E DS PAS AS Q VAG I T 
GQ 


4531 


18432 


A 


4561 


125 


4 


GAYILYLNKKSKS IRPGMVAHACNLSTL 
GGRGGR I TRSGD 


4532 


18433 


A 


4562 


3 


84 


RSCHCTPAWATERETLSPKKKKKKNF 


4533 


18434 


A 


4563 


145 


3 


GFYHWPGGVEQSPCLSLPKYWNYHCEP 
PPLASIGFFFFFFETQSRS 


4534 


18435 


A 


4564 


125 


388 


I RKPQ ILYSPQSS ENMQHLH I SVLMGVT 
GKRGKFMD I LDVTE RCR F S L KKCTG I Q E 
DLNKWEGIPCLWI I RLNT VKMAVF S KL I 
CRFS 


4535 


18436 


A 


4565 


59 


325 


VARKGNLVS FL V I FY PLFDIEEVIEP LW 
VLISSFDEGEKVELEFFFFFFFKKKSPF 
VTQAEFQWPNLSSLKPQPPRLKQFFCLG 
LLIT 


4536 


18437 


A 


4566 


326 


3 


PEKEFRRLVIKLIREAPEKGEAQCKEIQ 
KS I QE VKGE I FKE I DRIKKKQLKHQETL 
DILLVMQNALESLSNRIEQVEERNSEKE 
DKVFDLTQSNKDKAKRIRKYEQS 


4537 


18438 


A 


4567 


333 


209 


EQAFDP YGTLSQG I FS PKFHE P FHKLMA 
RPAVALP FQ VNLK 


4538 


18439 


A 


4568 


125 


2 


HNQSMSRSIDCSDPILGHCNLCLPGSSD 
S PASAS QVAGTTG 


4539 


18440 


A 


4569 


212 


2 


P I CLS GKKC YGQNVRVI KCHMWKHNHRL 
GVWHACNPNSLVGQGGRFMRSGVLEQH 
EQHAETPSLLKTLK 


4540 


18441 


A 


4570 


142 


14 


GRVDRSNPGRFLSTSNSSLYERPREIRP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=PhenylaJanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TSQAPSPVNHIIST 


4541 


18442 


A 


4571 


2 


419 


ARDS FLHDSQTS FCFSDS I PTPSNMEET 
QQKSNLELLRISLLLIESRLEPVRYLRS 
MFANNLVHDTSDSDDYHLLKDLEEG I QT 
LMGRLEDGS RRTGQ I L KQT YS KCDTNS H 
NHDALLTNYGLLYCFRKDMDKVETFLR 


4542 


18443 


A 


4572 


3 


194 


HEGRLQSQLLRRLRQKNCLNTGGGGCSE 
PRSHHS FPAWVTEHGF I S Q I S LKPLF I S 
HPEPFLL 


4543 


18444 


A 


4573 


2 


286 


CRDGLTYNDFLILPGYIDFTADHVDLTS 
ALTKRITLKTPLVYSPMDTVTEAVMAIA 
MALTGG I GF I HHNCTPE FHANEVRKVNK 
YEQGFXTDPW 


4544 


18445 


A 


4574 


44 


325 


RQWLQEEAYITKEQKYSFLHDSQTSLCF 
SDSIPTSSNMEETQQKSNLELLHISLLL 
IESRLEPVRFLRSTFTNNLVYDTSDSDD 
YHLLQDLEEG 


4545 


18446 


A 


4575 


374 


33 


LRLARRSGASLCGSRSGGPPTCANICPR 
YAP LI P P CRTS LLE AQGL VMWGPD FRV 
RLVTSSERRFRYQPPPPGFNRFPCLSLP 
SNWD YRHAP P RLANFNS L F F F F FL VE TG 

Y 


4546 


18447 


A 


4576 


400 


225 


P I F FWFFL F VL FL VETRL CHVVQAGLE 
LLGS SNLP I SAS QS AKITGMSHPAWPRA 
EF ( 


4547 


18448 


A 


4577 


2 


372 


ARET I FCFSNS I P AP WNMDETRQKSNLQ 
LLRISLLILLYRTRLTLPCITQLHILQT 
YKVTTARCSHFHS S S TCGPLYS CTLYVSL 
TGLDMKLCHSTLLKQNSALMSLLKIQW 
LNQVS P VFNAH 


4548 


18449 


A 


4578 


349 


174 


GCGEPRSHHCTPAWATRAKRCLKKKLKL 
QLKNWKESKHQPLQNFSHLTTSTINTQ 
EN 


4549 


18450 


A 


4579 


3 


124 


HELLNPEGEGCSEPRSCHCTLAWVTRAR 
LRLKKKKKKKKN 


4550 


18451 


A 


4580 


181 


1 


MSSYYFSPCSRLLMATVYNLHLPSSSNS 
PASASQVAGNAGARHYAWLI FVLLVETG 
LCSC 


4551 


18452 


A 


4581 


227 


3 


HRKRNRCVSRTALLFSLGDGVSPCLKI I 
I I IKVLFWLGE VAHACSPSPLGSRGGR I 
TRSGVRDQPGQHGETPSC 


4552 


18453 


A 


4582 


257 


2 


RVRQLGAFSLS PGSGS FSKI LGALKIMG 
P PRAGPVGS PRYSRPLGGPGRQ I PRSGD 
PGPPGLPRGNPVPLKKNPKRSLLILPAR 
A 


4553 


18454 


A 


4583 


3 


94 


HE I AMAFL I LTE RK I LG Y I QLRKG PNW 
GP 


4554 


18455 


A 


4584 


345 


50 


IYIYICI FLNRNRVSLHCPGWS KTPGLK 
QSSHLGLPKCWDYSHEPLCPAKTSHFKA 
QDRIP KND P S CT P S PAQMS TNL I G W AS S 
YLQSYS PAPYNNKN 


4555 


18456 


A 


4585 


338 


19 


FQLSRRFRPKNGGNPGSGACSEPRLGPC 
PSAWTKRDSVSKKKKEYHTLSGLSTTG 
I YFSQFWMLGVQGASRFGVWGGAVSWS I 
C GAVLLRPHMAE GARQLS GLGF 


4556 


18457 


A 


4586 


2 


224 


ARGVGRAGLE LQ T S GD P PAS AS RGAGI T 
GVSHRVQLLGDRAKFSLKKKKRGI ILPG 
EFYSPLKKRFTLNTPDNP 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


4557 


18458 


A 


4587 


344 


168 


P PGT WHT YNP S TLGGQAGR I GS LG VQD 
KPGQHRQGLSLGVEDKPGQHSETPSVQS 
3STP i 


4558 


18459 


A 


4588 


212 


328 


GLAVAQAGLELLS S SNPP AS AS Q SAG I T 
GVSHHTQPYF 


4559 


18460 


A 


4589 


110 


323 


LSNQKLTQLYNPNLLIYCWRKLKSSYDK 
QAEVQGSKLGSLQALPHGFTPLSGLNLP 
S S WE YTRPPPRPTNF 


4560 


18461 


A 


4590 


2 


127 


ARGCSEPRSRHCTPAWATEQVSKKKKKG 
REKTFGQFFFWVI 


4561 


18462 


A 


4591 


211 


2 


ARG S LE AGS ST P AWAT I NQ S TNKKTD KT 
ALYWKKMPSRTYKARQKSMPGFKSSKDR 
LTLFLGAKVSCLV 


4562 


18463 


A 


4592 


3 


120 


HEKTKTNNP I KKWAKDMNRQLS KEDNDN 
KINGWPGAVF 


4563 


18464 


A 


4593 


388 


227 


GHVGQDGLEKLTSDAPPSLASQSAGITG 
VSHRAQPKWLLFQHRSPCLPEAAGK 


4564 


18465 


A 


4594 


1 


348 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PLPPPPPPPPPSSPSSSPPNPPATTSVA 
C 


4565 


18466 


A 


4595 


2 


138 


ARAARGFRHVGQAGLELLTSGD P PAS AS 
RSAGITGVSHHTQPTNF 


4566 


18467 


A 


4596 


315 


2 


ALKPNYSSTSMNIKARLPQYNINNLNLA 
PLYSSTSNQINIIVIILINKENNHTLFP 
CLVSNSRLKQSSHLGLPKCWDYRRELLC 
liACDKFFISYSAPAlQLLSC 


4567 


18468 


A 


4597 


1 


146 


GTSVDQGSLELLTSGDPPASASQSAEIT 
GVSHRTQPPIiTI IQLYLGYC 


4568 


18469 


A 


4598 


293 


3 


GGTPWPRKKLGQRKFPHRPPGGSGNPDF 
SliLFFFFLRQSFALVAQAGVQWCDLGSL 
QPPPPWPFKLRKQFGFSTSCFPILPAS 
GS FFGRE AS AS C 


4569 


18470 


A 


4599 


174 


2 


RSVAPFLKSARPQTYRKEPTPDTGSPQP 
P P PRFKRFS CLiTRLS S WDYRHAP PHPAS 
C 


4570 


18471 


A 


4600 


59 


252 


LAPFGHELGTSESAHQTDNRHRLEGYQK 
RIiDASGLERASYPIiAAEFKVGGRGCSSQ 
PLTLCGYP 


4571 


18472 


A 


4601 


233 


366 


I HYLPDAVAHACNP S TLRGQGGR I TRS R 
DRDYPGQHGETPFLLK 


4572 


18473 


A 


4602 


182 


325 


LFVFCLFLRQCXSVAQAGVQWHDHCSLY 
PQPPSLKQSSRLGLPSSWDY 


4573 


18474 


A 


4603 


202 


2 


GGSSSGLLRPCAVLGLKQSSYLSLVRRW 
DHNSWLKRSTHLGLPKGWDYRREPPHLA 
CILKFSINLV 


4574 


18475 


A 


4604 


317 


2 


FRLGPQVFFS PPGAGFFFCGPVWGPGP F 
SRSLQLLTFGAKFFFCLSLPESFGAPLR 
GPPSGFMAHCFFFLVLLFFFFFFRDSVL 
LCCPGWSAVAIHRRDPTTARA 


4575 


18476 


A j 


4605 


233 


71 


FFVXjLVEIEDQLNEKKQEGKIRKKKRMK 
RNKQSLQKIWDYVERPNLAGHSGSRL 


4576 


18477 


A 


4606 


114 


1 


HPRPARVSSCCPNWSAMVWSRLAATSAS 
RVQAILQSRA 


4577 


18478 


A 


4607 


348 


3 


LKFQNFRGVVHGARCPSFFGGLGRGIPF 
TPEVEVLVNLGPAPALPFWPPSQTLFSQ 
NQKKNQKKKQERKREKERKEERKKKKER 
KKKEKERERKEGRKKGQEKKRKI TAKSL 
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NO: of 
nucleotide 
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od 


SEQ ID 
NO: in 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Pheny (alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine 3 M=Methionine, 
N=Asparagine, P=Proline } 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine ? 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














LG 


4578 


18479 


A 


4608 


174 


3 


TWMWQQCVFVCLLKQGLALSPRLECT 
GI I IAHCNLKLLGPSDPPTS VEKTKKLV 


4579 


18480 


A 


4609 


171 


334 


DPHSSLNIRFSRSVAQAGVQWCNLNSLQ 
PPPPGFQRFLCFSLPSSWDYRCAPP 


4580 


18481 


A 


4610 


2 


164 


ARGSRGHIWSAFCHVGQAGLELLTSRPA 
LASQSAGlTGISHRAQPPLNTIiYSMS 


4581 


18482 


A 


4611 


2 


164 


AREPRGHIWSSFCHVGQAGLELLTSRPA 
LASQSAGITGISHRAQPPLSTLDSMS 


4582 


18483 


A 


4612 


180 


334 


CCLMGSSYLLPRKPNTLRTAGFLQQRKS 
IiIRPGMWHACNPSTLGGQGGWI 


4583 


18484 


A 


4613 


189 


1 


SGFSRWENGLKCKGQKGGQSIiVLLPIR 
VF F F F F FEMKSHS FAQARVQWGKFLGS LQ 
PLPLGFK 


4584 


18485 


A 


4614 


216 


358 


KSTPRLGTVAHTCNPSTLGGQGKRITRS 
GDRDHSGLHKGTPRLSKNI 


4585 


18486 


A 


4615 


179 


393 


EKQEEEGSWGWAKTEDGVALMLSEAIGS . 
CWI KAFIiTDAETFTFHAD I CTLSEKERQ 
IKKQTALVELVKHKP 


4586 


18487 


A 


4616 


2 


126 


ARVQRYLFLFFTLDGPLPDPSMIRGSVP 
NQMMPRITPQSGK 


4587 


18488 


A 


4617 


338 


181 


RVGEAGLELIALSDLPASDSQSAGITDI 
SHHTRPGSPFLEHHLYPRYRISGK 


4588 


18489 


A 


4618 


3 


197 


HEKRP EVQNS TVE F IAS SD YMLRPPL PL 
PFCFFLFFFKIKMGVSP CCPGS CQTP SL 
KQSSCLNV 


4589 


18490 


A 


4619 


, 1 


339 


GTRTFGSGNGPTKPDLLQELRVATERGL 
VIWCTHCLQGAVTTDYAAGMAMAGAGV 
ISGFDMTSEAALAKIiSYVLGQPGLSLDV 
RKELLTKDLRGEMTPPSVEERRPSLQGN 

T 


4590 


18491 


A 


4620 


3 


306 


LEPGDGGCSELKSCHCTPAWVTERDSIS 
KKKKKKKKGGPLKGTLGGPKFNGGWQRK 
I FS QKGGQKKPNLG I LGRNLNFGGE KNW 
NKFLTKIKVLREKKNF 


4591 


18492 


A 


4621 


397 


29 


NTKE K I L S ARKNNQ VFP PHTQ KKNF F P K 
KRKNFREGIPPNFPPPKKSPPQKNHPPQ 
KKKP P KE KKKT I AR P P KKRAP P GAFKKK 
PPPFFFLSSLCISYSFIYFFSLINLFFF 
FFFFHDSTVY 


4592 


18493 


A 


4622 


2 


235 


VSLCHPGGTISTHYNFCLPGSRDPPTSA 
SAYALL I FVFFVETGFPLVAQAGPKLQG 
TNS I PAS AS QS AG I TGMNHCT 


4593 


18494 


A 


4623 


209 


324 


ILRVLWSGTVAHACNPSTLGARGRQIMR 
SGVRDQPGQY 


4594 


18495 


A 


4624 


52 


287 


NPFSFLFFSFLSFFLFFFFLEKGPRRGA 
RAQKG KPHP KKKKKKKRGGGKKNKTQ KP 
FSPLKGPRGPPRGKGQTTPKGP 


4595 


18496 


A 


4625 


102 


1 


THSFPVS ISLSLSHTHTHTHTHTHTE I Y 
RVREIR 


4596 


18497 


A 


4626 


125 


1 


DRHVPQHTANFFWL I EMGFLHVGQAGL 
KLLTSGNPLVPLV 


4597 


18498 


A 


4627 


3 


167 


ERLRQQNCTNPRSEGCSEPGS PHCTPAA 
WATEQDSVSKKEKEKRKKEKKKKIW 


4598 


18499 


A 


4628 


135 


3 


AASTLSRGTITAQGNLCLPVSSNSPASA 
SRVAG I TGARHDTWIi 


4599 


18500 


A 


4629 


320 


210 


WFLG I I FRRVENAESVNIDLTYD IQSFT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K>=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














DTGKTPSS 


4600 


18501 


A 


4630 


98 


326 


SSPTLAFQFQYFFSCATKEAVKXjSFPKS 
LLGTGSRSVAQPGEQWKNHGSLQPQPAQ 
LRLSSCLTLVSGWDHRCAPP 


4601 


18502 


A 


4631 


275 


2 


RVKLLL F YMRQ S C S V I Q VE S S S T VNGS L 
QPQPPGPRQSSHVSLLSRWDHKRMPPCL 
ANFFLFFWRAKGLAILPVHSNLVNTESP 
LISIKSN 


4602 


18503 


A 


4632 


180 


1 


KGLTLYFYFFI FKMRRGLAMLADWSQTP 
ELNQ FS CLGLP KCWDFRHE P PHP VKE FK 
PFHR 


4603 


18504 


A 


4633 


234 


5 


KPLRDKPLRSFSNIiPKKNLNFFWPFFPQ 
KIWGFFFFFFBTESHCVTQAGVQWLDLS 
SLQPPSPGLRQFLCLSLSSS 


4604 


18505 


A 


4634 


171 


2 


GRVD I LTPWQHFE TLFS FFS FF WQGLS V 
TRLECSGVIMAYCSLDLLGSGNPPTSAS 


4605 


18506 


A 


4635 


324 


211 


LRNI IFMPVTMNPGMYTGGCMCVCVCVC 
VCVCIiCVCVC 


4606 


18507 


A 


4636 


95 


1 


MRQS INLSPRLECSGTI PAHCNLCLPGS 
SNS 


4607 


18508 


A 


4637 


234 


1 


KKGHDTLTEKAMPVTLSSRMLS INVCKC 
IYKWKNSPNYSSLTHHINNIKPSFTRE 
NTLMFIHLSPILLLSLNPDIIT 


4608 


18509 


A 


4638 


2 


243 


WGKVNVDEVGGEALGRLLWYPWTHMYF 
EFFGAFYISFCLFFLIFPHLFFVSSIHI 
VSS FYLL.FYNFP 1 1 FFTLFCHFFti 


4609 


18510 


A 


4639 


118 


2 


GRVDLQCSGAISAHCKLHLQRSRHSPAS 
ASQVGAHATT 


4610 


18511 


A 


4640 


50 


168 


NKINKMHIiITFLKKKKKKKKKKKKKKKK 
KKKKKKKKKKN 


4611 


18512 


A 


4641 


315 


87 


GIFLRDRNLALLPRLECSGTI IAQCNLE 
LLSPKDSPASPSPVARTTGVQPRPDTLT 
IiFEL I Y I R YKGM I FS QGD C 


4612 


18513 


A 


4642 


383 


2 


KFKKKKFAGGKAPARLSPPLWGPKGGGP 
PRAGVLSPPGPPGETPFFLKNPKFPGHG 
GGPRKSPFFGGLKGENSLTREKGGPINP 
NLAP P AF P PKGQNP GP F P KKKKKKKNRS 
S PAWNWDWIGTNAGK 


4613 


18514 


A 


4643 


3 


205 


IiKRFFCFSLGGQGCSEPRSCQCI PAWET 
QRDSISKKKKKKKTRPGGLTLEPPPFGG 
VKPKGPLGAGI 


4614 


18515 


A 


4644 


149 


1 


F FP I KADDVFFF F FLRRNLADS QGAVQ W 
HDLGSLQAPPPGFMPIiSCLSL 


4615 


18516 


A 


4645 


474 


344 


AIHEHYLPESSASGDCDEDLQAACTILV 
NALMDFHQRIAAHTV 


4616 


18517 


A 


4646 


165 


357 


LCNALSHLLPQCTPTPLISILCIIDTIS 
LLIQPITHAVRLSANITACHLLMHLILL 
STLSICTL. 


4617 


18518 


A 


4647 


2 


339 


ARGENDLSPGGGGCSELRLHHCTPTWAT 
RVNC VS Q KKKKNQ P KP KPNQTKPNQTRK 
GSPGFPLLNLKVISPEKSLNPNILKTAP 
KVTFPJSTWIjTPE I PP FWKMEEKGS FDPR I 


4618 


18519 


A 


4648 


313 


187 


LTSGDPPASASQSAGITGVSHHALPPLV 
QFPSHSLTPTPLFC 


4619 


18520 


A 


4649 


236 


324 


GQVMWPGMMAHACNPSTLGGRSRWIVRS 
R 


4620 


18521 


A 


4650 


225 


313 


GQVMWPGMMAHACNP S TLGGRGRW I VRS 
R 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H^Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possibIe 
nucleotide insertion 


4621 


18522 


A 


4651 


240 


76 


FCVHAHTHTHTPS FQASGSSEHMHTYTH 
PSGSPEHTHTYTHTTFLASRSAEPQLP 


4622 


18523 


A 


4652 


273 


379 


KQLAAQCVRPLSPSQAIASPVTCTYMPR 
RTPEVTEE 


4623 


18524 


A 


4653 


369 


189 


GTVKQAGAQCRNLGSLQPTPPPGFKQFS 
CKSLYSIMICYSKDCTMSTTLGIRANNR 
LHL 


4624 


18525 


A 


4654 


168 


346 


DRICRASSYTIHLMLYFVFLIFIjFYWYI 
YFLFFVL FHFT I F 1 1 LFYS F 1 1 LIiLLL Y 
FYN 


4625 


18526 


A 


4655 


56 


354 


WKRTFPFFFFLETGSGFASRPCTLLLFL 
YSLGPNPRVLIRVFLFLSSSRCYYYYLF 
LGLSLLFLFFYIMYLLFIRLFFFVLSGL 
LLLLSSYYRFIIFFF 


4626 


18527 


A 


4656 


50 


186 


SACLGLPKCWDYRHEALRPAMYLIiTGSS 
YVAQAGLNLLGSADPPE 


4627 


18528 


A 


4657 


131 


366 


DHFVASSESVKEEGLFFVFVFFFFIFIF 
FFILYICFFYFKFIFIILLYIIIIFIFI 
FYFFYLYYLILIFFLIFFIFIG 


4628 


18529 


A 


4658 


290 


155 


LEY I VEKGFHQDDKAS PELLTSRYLSDS 
ASQSAGI TGHRAWPTKG 


4629 


18530 


A 


4659 


163 


372 


LRLFALFAFLFMFFFYFLFLLYVFCFVF 
FLFFFFTFFLYYFFLYFFLYFLCSLVFL 
FFICFIFFWFFVFF 


4630 


18531 


A 


4660 


227 


378 


FIFFSHCTSTAIiVHPLFYLSRLVLSKNR 
NKF I GRARWLT P VF PALWE AE A 


4631 


18532 


A 


4661 


3 


141 


RSRHCTPAWQQSEILSQKERKKERKKKK 
ERKKEKERKKERKKKKKK 


4632 


18533 


A 


4662 


346 


3 


TFKFLGGVCPFFS PP KKRVFS KNS PGGF 
FLPPFLGENFFFAPPVNFGPPGGFFKGG 
PPFFFFFFFFFFFFFFFFFFFFFFFFSQ 
DSFDNLGIFVIPYKIQDDFSSSMSNATS 
ILIKIAL 


4633 


18534 


A 


4663 


302 


3 


GD INRRSHS PFVWS KP FAPCFMS S LDVF 
YKSSDFFYFFFETEFHSCCPGGISAYCN 
LCLPGSSNSPASASQVAGITGVRHHTQL 
IFASWETFKITYKN 


4634 


18535 


A 


4664 


2 


76 


RLAP T PQLGGH WP P TG I TPLNPIiE V 


4635 


18536 


A 


4665 


1 


210 


S P TRS P KP T P P Y YQTTLAKP FT KKKKKK 
KKKKKKKKKKKKKRGGAPliKKS PGGPKI 
NRGKKKKFFFLKGG 


4636 


18537 


A 


4666 


2 


356 


GSRPGGGKRTARRWTRPWRLRGACLGTG 
MAFAFHHQVI YKI KSFLRKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKPRG 
GGFLKKLWGGKFFMGGGKIFFFFIRGGK 
KKMWGG 


4637 


18538 


A 


4667 


1 


323 


RTRG I F C YD VCVE SGCADI QLLLL CLKK 
KKKKKKKKKKKKKKKGAPLKKNPGGAQN 
NPGKKKKNFSPKRGPKKTPPGNFEKKPF 
FGGGPNGAPPPQKNKTPKEKKKF 


4638 


18539 


A 


4668 


167 


446 


E LNKGL CKP KKKKKKGGGRS QFKPTRPK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KS 


4639 


18540 


A 


4669 


321 


52 


GVFSPFSPKKFFFSPKGLNFWGGFAPIF 
SPPKKRFFSKNPQGGFKYPPLRGKILIW 
GPRENWGPPGGFLKGAPLFFFFFFFFFF 
FFFFWG 


4640 


18541 


A 


4670 


82 


359 


NAYGI I L I KGNEDTKTVKWKSS CQLANT 
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SEQ ID 
NO: of 
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sequence 


SEQ ID 
NO: of 
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sequence 
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NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
G=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=G!ycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














WEFHIRALLGFNCFFLIFKRRGLTLLPR 
LDCNGVI I IHNNLESLGLSNPPMLASQV 
ARTTGLCY 


4641 


18542 


A 


4671 


249 


60 


LLIFIYYFYFFLFFFLFFFFFFFFFFFF 
LFVQVHLPVHLPCYDLSPLYKCVGVLVK 
CPFKYT 


4642 


18543 


A 


4672 


2 


319 


AAASTLVF VFS FFW VLGF FFFFFFF FWG 
PPKKILKKKGGGGGGGGGGPRGGPRGGG 
KRNPGGQKKGIGGANGGQKKERNFQGGG 
PPRGEKRGGGGGPGNPKGGGLK 


4643 


18544 


A 


4673 


3 


451 


HESFNPVI ISQLERGEDPWVLDRKGAKK 
SQGLWSDYSDNLKYDHTTACTQQDSLSC 
PWECETKGESQNTDLSPKPLISEQTVIL 
GKTPLGRIDQENNETKQSFCLSPNSVDH 
RE VQ VLS QSMP LT PHQ AVP S GERP YMC V 
ECGKCFGRS 


4644 


18545 


A 


4674 


446 


189 


SISLSTLVNVCQLISLSDPASGFVDHVC 
CI FT FYS INFALILI FIiFYLGVLiVWAAI 
INYHRLHGLNTKHLLIMIWMVGSPRLSC 
G 


4645 


18546 


A 


4675 


26 


170 


HLWFLV I VAHWAS S RQ IIS HW KNCTRH 
DCPVCLPLKNASDKRNQQSK 


4646 


18547 


A 


4676 


3 


378 


HEDEFDKMGNQHQAIiliEAMEQQS ISLAK 
AGWC I LPARTS 1 1 AVAHS FAGLYL I FN 
T VS WFFFFSNF I FI» ILSLFN YFCI S I Y 
ILFLIIFIFIFISSLIVFFCLISFRFLL 
FIFFSALSRIFIY 


4647 


18548 


A 


4677 


18 


242 


PIGLALSPRLECSGVIIAHLLGSNNPPT 
SGSRVAGTTG IHHHAQPHCFNC I FPMS S 
EVECLFICLLiLIHIIiSRA 


4648 


18549 


A 


4678 


288 


2 


PKYRWVPEKPTTASENAEIPSERQKEGI 
KLTIRISSRKKKPDSPPKVLEPETKQEK 
P E KE E E KTNVGRT LRR S PR I S R P P AKVA 
QIRDPKAHKKK 


4649 


18550 


A 


4679 


340 


208 


S VGQAGLELL TS GD P PATAS QSAG I TGL 
SQCYPPDSVPETFLPL 


4650 


18551 


A 


4680 


4 


379 


EENTG INLHDLGFGNGFLDMAPKAQATT 
AKKKKKKKKKIKIKSFLFLGGLFKKGKK 
TPPKGEKNFGNFKFWGGDLTPKI FGDPG 
AS TKKKKP FL KGGRGL KKTF F KKKKKRG 
PHPLESPSPFSVF 


4651 


18552 


A 


4681 


1 


350 


SRFSGAKEAKMEKKITGYTTVDISQWHR 
KEHFEAFQSVAQCTYNQTVQLDI TAFLK 
T VKKNKHKF YP AF I H 1 LARLMNAHPEFR 
MAMKDGELVIWDSVHPCYTVFHEQTETF 
SSLW 


4652 


18553 


A 


4682 


365 


45 


S PKWFVAS YDP YAGPGRPTRPDNSRDET 
GS HS VS ESS SGSDS KRE S TS S DRE ANE P 
SHRAS PE PE P P PTNKWQLDNWLNKVN 


4653 


18554 


A 


4683 


238 


372 


SATPLISSFSFIiFFFFFFFREGVWLCHP 
GWSAVSRDCTTAAVWPG 


4654 


18555 


A 


4684 


3 


462 


KTKESREAVEKEFEPLLNWMKDRALKDK 
IEKAWSQRLTESPCALVASQYGWSGNM 
ER IMKAQAYQTGKD I S TNYYAS QKKTFE 
INPRHPLIRDML.RRIKEDEDDKTVLDLA 
VGLFETATLRSGYLLPDTKAYGDRI ERM 
LRLSLNIDPDARG 


4655 


18556 


A 


4686 


410 


0 


FFFFFFFFFFFLLGKGGFGI IKFL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X~Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


4656 


18557 


A 


4687 


151 


2 


IFFFFFDRVQWHKLGSLQPLPPRFQQFS 
CLSLPSSWEYRHAPPRVRPRV 


4657 


18558 


A 


4688 


342 


183 


FFLFLRWSFALVPQAGVQWCDLGSLQPL 
S PGFKRFS WAE VGGLHE PRS QRLQ 


4658 


18559 


A 


4689 


126 


404 


NCQVWPLRFTGHSVFSVHPQWFILFYCF 
ALYF I FLEMGS YS VTQAGVQWHSLGSLQ 
PPQWSYVLTFLWHPSSWGYFVQNHLTPC 
FDGKVPWV 


4659 


18560 


A 


4690 


2 


128 


GESLFYIVSPRDWVAKERDQDDHIDWL 
LEKKKYEVPLILLL 


4660 


18561 


A 


4691 


407 


2 


YFFFPLPRMKIFFFPHQKGFFFGGCPII 
S P P PNE VFFS P KEGGF F F S P P KKRVF FH 
NPPEGFFPPPPFFMSPPPPFFFFFFSLF 
FFFFFFFFFFFFFFFFFFFFFFFFFSFR 
IILLNQKKNLSACVELRSCCTFM 


4661 


18562 


A 


4692 


1 


176 


SRRLNPGGRGCSEPRSHHCTSTWSTERD 
S VS KKKKKRF FAL KFLGE ANVKP T I GG Q 
KN 


4662 


18563 


A 


4693 


1 


403 


S KAE I GRRLGLLRQTVS Q WNAKEKFLK 
E VKS ATPMNTRM I RKRNS L I ADMEKVL V 
VWIEDQTSRNI PLSQSLIQNKALTLFNS 
M KAERG VEAAEE KFE AS RG WFMRFKE R S 
HFHN I KAQGEAAS ADVE AAAS Y 


4663 


18564 


A 


4694 


343 


425 


HCFLS LQLNS FEQLCINYTNE KLQQLF 


4664 


18565 


A . 


4695 


395 


2 


KNKGLFFFFFLRWS FTLVTQASNVTTAV 
CNVTILAHCNLHLPGSSNSPASASRVAG 
ITGSCHQG 


4665 


18566 


A 


4696 


429 


0 


VTKWC YI TKKKKKKKKKKKKKKKKKKKK 
KKARG 


4666 


18567 


A 


4697 


371 


217 


IPVFKQSSCLSFPRSWDYRHMPPCVTQK 
KKIEYQHISLYKKPSKRCALFLI 


4667 


18568 


A 


4698 


415 


59 


LQQQCECFISELTKKKKKKKKKKKKKKK 
KRGGGLE KKKKKGFFAP YP PH 


4668 


18569 


A 


4699 


431 


0 


LKKIFPPGGRVLSPSPPPPPPPPPPFPP 
P 


4669 


18570 


A 


4700 


371 


245 


TLGDPPASASQSAGITGVSHHAQPLFGI 
FIYKFTYICRKTRT 


4670 


18571 


A 


4701 


410 


0 


KQCWFTQKKKKKKKKKKKKKKKKKKKKK 
DRG 


4671 


18572 


A 


4702 


340 


402 


P FAGGG YRLGAAPEEES A YVA 


4672 


18573 


A 


4703 


188 


2 


TSLPKCWEYRHHPPCPAHTLLIFLFFIF 
VDMGS CHVAQVALELLGS S D P PTSAS QS 
AGIIGM 


4673 


18574 


A 


4704 


423 


34 


LKKKKKKKKKKKKKKKKKKKKKKKQKDL 
KKMVGGGNYKNVGGAQILGYKKLCFS FF 
YRVF 


4674 


18575 


A 


4705 


2 


297 


PRVREREKEMASMKEEFTRLKEALEKSE 
ARRKELEEKMVSLLQEKNDLQLQVQAEQ 
DNLADAEERCDQL I KNKI QLEAKVKEMN 
ERLEDEEEMNRSLP 


4675 


18576 


A 


4706 


2 


179 


QENGMNPGGRACCEPRLRYCTSS WATER 
DSVSKKNEKKRSASLRRFDILFGALTGR 
KAS 


4676 


18577 


A 


4707 


432 


578 


PRIVCFQELRKPSYAEICQRTSKEPPSS 
PLQPSSSSSSSS 


4677 


18578 


A 


4708 


419 


35 


GSFFFFFLKDQPKKGRKKEGCPPPPPKK 
1 1 YP P P ATKWGGGGGKKP P P P KKRGPPK 
KNRGRKKKP P KKKKKKF P P P P KKRAPQK 
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SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=AIanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F-Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, HNHistidine, I=IsoIeucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L~Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














KKKKPPPP I FFYLYFFKNF I FFFFFFF Y 














FFPFFFFKFIFNLKKF 


4678 


18579 


A 


4709 


147 


31 


NFFFFFFFFFXFXFFFFFXXXKFFLFFF 
* FFFLKLKIFF 


4679 


18580 


A 


4710 


3 


229 


HASAHASDQPERLNAGTYFLFYTLVGSL 
PLL I AL I YTHNTLGS LN I LLIiTLjTAQEL 
S KKKKKGGAVL KNP WGAQ S 


4680 


18581 


A 


4711 


1 


158 


PTRPPTRPRSCSELRSHHCTPAWVTERL 
CLKKKKKKFPNLSVGFKILG I LKG 


4681 


18582 


A 


4712 


163 


404 


KKKKKKKKKKKKKKKKKKKKKKKKKKKP 
KKKKNKKKKKKKKKKKKKHPS RGGAQ KK 
KKGGGAQNRAGGKRRPSRGGGEKQN 


4682 


18583 


A 


4713 


3 


378 


FEEPVYIKSRQKRKSSNPPKLVSSQPHG 
LHDFFKKKKKKKKKKKKKKKKDGGGPLK 
KKPGGAKNKPGGEKKNFFLKGGGKKNPW 
GFFKKKTFFGGGKIGAKPPKKKKSLKKK 
KKFLRGKGGKKT 


4683 


18584 


A 


4714 


2 


208 


VSNPAVSVPHLPFSVYKSS PMASMTFSK 
KKKKKTKKKKKKKKKKKKKKKKKKGGPQ 
KKKPWGAQNKPG 


4684 


18585 


A 


4715 


290 


56 


SAPPPIFFLFFFLFLFIFFIFFFFFKQI 
liEKKGGPPPFFLFFFFIFFFFFFYRMGC 
DRWWFILIQGTFHRKQKPVLKV 


4685 


18586 


A 


4716 


50 


424 


GGFKI KFLiFTQFLiFLLjFS PSMFIilSKS P 
AYLWQ S S P KQVPMWFGKTNFLVS IECK 
KEGIYFFClPFVPFGFPHPKIIiCSLLPF 
LAFLRRS IiALLPRLECSGAI LTHCNPCL 
PGSSNSHAWA 


4686 


18587 


A 


4717 


161 


2 


GRPGPADFRVRPQLLQRFIiF I YLFTEME 
SCSVTQAGVQWCNLGSLQPLPPGLQ 


4687 


18588 


A 


4718 


2 


115 


VYTTAMAI EE YPEAWGSGVE VLVS VLVG 
LAMEVGSTRP 


4688 


18589 


A 


4719 


1 


416 


GNQGGGYGGGYDNYGGGNYGSGNPSSSS 

ssssssssssssssssssssssssssss 
sssssssssssssssssss 


4689 


18590 


A 


4720 


1 


454 


QPDSGISSIRSF KT I L P AAAQD V Y YRDE 
IGNVSTSHLLILDDSVEMEIRPRFPLFG 
GWKTHYIVGYNLPSYEYLYNLGDQYALK 
MRFVDHVFDEQVIDSLTVKI ILPEGAKN 
IEIDSPYEISRAPDELHYTYLDTFGRPV 
I VA YKKDTLi VEQ 


4690 


18591 


A 


4721 


3 


173 


DAWAGFHYVGQAGLELLTSSHLPASASQ 
SAGITDVSHRTQLDVPFSFSLSPIjPHQC 


4691 


18592 


A 


4722 


2 


201 


L KPGGGG CS EP RS CHCS T P AWRQ S ET P S 
Q KKKKGKWGIjER I G WGGS T TNNGAKNQR 
GEKFGKKGGF 


4692 


18593 


A 


4723 


2 


183 


LKPGGGGCSEPRSCHCSTPAWRQSETPS 
QKKKKGKWGLE 1 1 CWGGSPTNKRATNS P 
VEKF 


4693 


18594 


A 


4724 


1 


395 


QDATS P 1 1 EEL I T FHDHAL 1 1 1 FLICFL 
DLHALS LTLPSALT 


4694 


18595 


A 


4725 


388 


3 


SFSPLEENLGNTIQDIGRGKDFTSKTPK 
AMATKAKID 


4695 


18596 


A 


4726 


223 


44 


WLFFFXPLFFFFFWGGIIFFLLKKLYIF 
FFFMSKIFFFFFFFFFFFFFFFFFFFFF 
FFFF 


4696 


18597 


A 


4727 


41 


374 


KKKKKGKKKKKGGGGGGNPKKKKGKMEK 
P PGGNKRGEGGKKKNFLKKKGGGLFKGK 
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eth 


NO: in 
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ed end 
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peptide 


od 


USSN 


nucleotide 
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E=Glutamic Acid, F=Phenylaianine, 
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sequence 
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26 
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peptide 
sequence 
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location 
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ding to 
last amino 
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peptide 

sequence 


G=Glycine, H=Histidine, I=Isoleucine, 
K==Lysine, L— Leucine, M-Methionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=VaHne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /= possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KP PKKKNGGRKPWKKKNFKKKKKKAPFF 














LGPKI FKKKKKRGLKKKLGFFSQSVGP 


4697 


18598 


A 


4728 


323 


69 


FFFLRQSLGSASQAGAQGHNLGSLQPLP 
AGFQLFLQPPPPGSKQFGRLNLLGNKDL 
WGLP VLRAH PQAP PLAF I L 


4698 


18599 


A 


4729 


3 


197 


CLALLHLSRQRQYALPLLGLPGCPHIHT 
HSHTHAHDHTHYTS PHAHRHAHTHTHMH 
TLPYTTHCI 


4699 


18600 


A 


4730 


309 


443 


VLTLSPRLECSSMNIAHYSLKKLGSSDH 
PTSASQSGGIIGMSYHA 


4700 


18601 


A 


4731 


3 


172 


DAWAFFNGPGVSLCCTGWSRTPQLiKRST 
CLSLPKCWDYRPE PLS PAP KTLFS YVLL 


4701 


18602 


A 


4732 


98 


232 


QTVFLLIWLLALCSPGVLQTVKRFLEFR 
CIFSLGIVTFICLYRN 


4702 


18603 


A 


4733 


238 


1 


LGPPPPSPLGPPPFLVIFRGFSFPKTPL 
FFSPFFFGPPPFFFFFFFFFFIRRSFAL 
VTQAGVQWRDLGSLQPLPPGFKR 


4703 


18604 


A 


4734 


121 


424 


LIQGHRWIYFSQLRKVRGRQACDSTLL 
GHLPKWRRMFPFLSFTVAGLEPTSHYR 
MFVDWIiVDQHHWRYQTGKWVQCGKAEG 
TMPGAR 


4704 


18605 


A 


4735 


319 


8 


NPPQKKKKKIFFPPKKKTPPKKIKKNPP 
PFTIFFFFFFFFFFFFFFFFFFFFFFFF 
FFYFQHWFCFEI IHVIFFLIEVLFLAF 
HVGDS IADAWSTKNSNRTK 


4705 


18606 


A 


4736 


297 


442 


TQGFFGGGPRKIGFFFRGLKNGGGGNPK 
LRFLKPGQLVGRGKTKKNQG 


4706 


18607 


A 


4737 


430 


0 


CVLGSKDKKKKKKKKKKKKKKKKKKKKK 
KAKKKDS 


4707 


18608 


A 


4738 


137 


1 


INYYFFLKQGFPLLPRLECSGAI IAHCS 
LEFPGSSDPRPSASQVA 


4708 


18609 


A 


4739 


188 


3 


ISPKALNPRREVGPIFPPPKKKVPPQNP 
PGGFFTPPHKEKNFSLPPPVNLGPPKDP 
LKRPP 


4709 


18610 


A 


4740 


188 


2 


TS LPKC WD YRHEPPCPAHTLL I FLFFI F 
VDMGS CHVAQVALELLGSSDP PTS ASQS 
AGIIGM 


4710 


18611 


A 


4741 


410 


1 


RRPGGGFCPPPPPKIFFSPPPPPFFVGV 
LPQTPPPPKKFFFLNPPPGFFFPPPKKK 
KIFFFSPPFFSPPPIFFFPPPPPFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
GKDRVSLCCSDWSRTPGPKPSSLL 


4711 


18612 


A 


4742 


225 


444 


ENLKLKSFPCSHFHWLRWFNLTIGALSR 
S FDEAALLFFQNDTYWPGTVAHACNPST 
LGGRGGR I TKSRDRNHS 


4712 


18613 


A 


4743 


2 


452 


VSNPAVSVPHLPFSVYKSSPMASMTFSK 
KKKKKTKKKKKKKKKKKKKKKKKSGGPQ 
KKKNR 


4713 


18614 


A 


4744 


3 


144 


LAAVLL KLGGYG I IRLTLILNPLTKHIA 
YPILLLSLNPDIITGFSS 


4714 


18615 


A 


4745 


206 


2 


IQNPQTERVNSTSKLKISSKDTINRAKK 
QPMDWE VI CKKH I FGKELISRMYKELQL 
NNKKQIVPPTRP 


4715 


18616 


A 


4746 


523 


178 


RHRRFFMDIEREQVKEQQRQKEQKKDIE 
KIKDKREQEGYAAEQRILRMNFHEDPYS 
GEKLSE ILAQLQLQE IKGTREKQQREKE 
YLRYVEALRAQIQEKMQLYNITLPPLC 


4716 


18617 


A 


4747 


3 


518 


ENRLNL EGGG CGE PRS RHCTP T WAT 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 


4717 


18618 


A 


4748 


491 


145 


LCHVGQAGLELLTSSDLPTLASRSAGIA 
GVSHRTRPHSHFYF YFLYSS S FFFFFFL 
AQRGNRAP RGKKT P FGLAT LN I KSNGGV 
FWFLFLGFGFLKREPTFALGLiKDRGGNL 
V 


4718 


18619 


A 


4749 


6 


362 


NKLPNGTEFLSSLVPNTFFFSFLFFPFF 
FFFFFWGKGGPPRGGGAKFYPLEFPPPG 
VPPPPKGGKKGGPPPPTNFLFFKKEGGS 
PFGPGGVKTPGPKGFPPPSPQRGGKKGQ 
WAQKPP 


4719 


18620 


A 


4750 


380 


3 


SLLLAFALLCLPWLQEAGAVQTVPLSRL 
FDHAMLQAHRAHHIiAIDTYQEFSETYI P 
KDQKYS FVHDSQTS FCFSDS I PTPSNME 
E PQHKSNLELLRI S LLL I ESWLE PVRFL 
RSMFANNLVYDPS 


4720 


18621 


A 


4751 


1 


345 


LF YI FKKTWSLFLCFLWMMCVF I FFFFL 
LFVFFLCLGGILVSFIPPGGEKREKKKR 
RGGE KKKKKKNWF KKKKKNP WGGGGG L 
KKQRARGGKKKKKKGGGGSGGGKKKQS P 
LRG 


4721 


18622 


A 


4752 


1 


407 


QEFEETYIPKDQKYSFLHDSQTSFCFSD 
S I PTPSNMEETQQKSNLELLRI SLLLIE 
SWLEPVRFLRSMFANNLVYDTSDSDDYH 
LIjKDLEEGIQTLMGVRVAPGVTNPGTPL 
ASRAGGEKYCCPLFSNKARDQEN 


4722 


18623 


A 


4753 


357 


1 


PLFFFKRNLFFFFFFTKKKEKKAPVFFF 
FPPPSREWGGFFFRVGGVNTQFLHPFFF 
SQNTSVFFSTLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFLFYCFIYFSR 
DRGLTLL 


4723 


18624 


A 


4754 


1 


207 


LGLAGVE E E VAS RLNAGGRG C S KP RSHD 
WTPAWAMEQDSMSKKTKNEYQRRQLIHL 
QS STSGFLYVHEQ 


4724 


18625 


A 


4755 


2 


336 


HEERERERERERERERERERERERERES 
RAQ FRAR KNS QAPH P RVCE T L WTE EGG S 
FYPLTEKDFAQCERAHFCVLGVSHPPGW 
GLCKKMNPE CERATHS LHGDAD I SGVL 


4725 


18626 


A 


4756 


230 


446 


VGFSSLFSRYCTILISASSLTKELCEKK 
RSFLFCFAFWLLGFLRQSLALSPRLEYN 
GAISAHCNLHLPGSSD 


4726 


18627 


A 


4757 


3 


390 


QTSFCFSDSGPTPFNMEETQQKSNLELL 
RI S LLL I E S WLE PVRFLRSMFAWNIiVYD 
TSDSDDYHLLKDLEEGIQTLMGRLEDGS 
RRTGQ I L KQT YS KFD TNSHKHDALL KN Y 
GLLYCFRKDMDKVETFL 


4727 


18628 


A 


4758 


234 


387 


LAKI FLS FSSATGW I ENRPQS PATGRTP 
VFVSPIPPPLPPPFPFAFVT 


4728 


18629 


A 


4759 


396 


0 


I YTHTHTHTHTHTYI YMYVFMI I 


4729 


18630 


A 


4760 


176 


384 


MVr PLFNKDTQHI ILFWGYVLIFFFLKR 
NF VF VAQAGGQGRYFGS LKP P P PRLKP F 
SCLSLLGTWDYRR 


4730 


18631 


A 


4761 


419 


208 


IYPPPPPWEPKGGSFLKKKKKKKKPGPG 
VPPFIPPPLGGQAGGFPNSKIQIPPAPQ 
GKPLFFQKNQNYLA 


4731 


18632 


A 


4762 


158 


2 


GNQLNP P P P SGSAF CFLFFLRRS LALVT 
QARVQWHDLGSLQPSPSGFKRFSC 


4732 


18633 


A 


4763 


489 


398 


AGFELLTSSDPPASASQSVGITGMSHCT 
QP 
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NO: of 
nucleotide 
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NO: of 
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M 
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od 
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NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny|alanine, 
G=Glycine, H=Histidine, l=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine> P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 


4733 


18634 


A 


4764 


420 


1 


FFFFFLGEIFPPGGKGGVFPPPPLKKFF 
FSPRGLFFLGGGGPNFPPPKKRVFFKKP 
RAGFFPPPGVLKRGPGPGFLTPPQKEKN 
FFFPPPGVLGPPKGFFKGGPPFFFFFFF 
F F F LGKKKKKKAL FF T KKKGGGGT VRKA 


4734 


18635 


A 


4765 


167 


3 


I Y FFFFLKKI FFYKKKINKLKKKKKKKK 
KKKKKKKKKKKKKRAAARDLELAHAS 


4735 


18636 


A 


4766 


424 


222 


FQNIHKKNFPSPPLLKTGPAPLFKKPPK 
KKKKYFFPPPEKLGPPREFLKGPPPFFF 
FFFFFFFFFFF 


4736 


18637 


A 


4767 


254 


423 


V I KS WARRNKNKWD W P GAVAL S CN P S TL 
GGPGQANPLRSGVQDHPGQHGETLSLLK 


4737 


18638 


A 


4768 


363 


0 


KTTTS QARGPTS VI PAAQE TEAGDS SNP 
SGYGSYSSSSSSSP 


4738 


18639 


A 


4769 


51 


169 


YYTHTHPRTGFVKKKKKKKKKKKKKKKK 
KKKKKKRGGGF 


4739 


18640 


A 


4770 


188 


3 


ISPKALNPRREVGPIFPPPKKKVPPQNP 
PGGFFTPPQKEKNFTLPPPVNLGPPRDP 
LKRPP 


4740 


18641 


A 


4771 


1 


131 


KLD KENALDRAE QAE AD KKAAARGKQL E 
DELGSLQKKKKGGPF 


4741 


18642 


A 


4772 


363 


452 


RLECSGMIIXHCSLNLPGSSDSPTSASQ 
LA 


4742 


18643 


A 


4773 


121 


1 


RPRRPVAQAGVQWHNLSSLQPLPPGFKQ 
FFHLSLLSSWDY 


4743 


18644 


A 


4774 


31 


226 


MISSQLSII PTPTQEQGLKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKGGGLLKKILG 
GAKFLGGEKKNFFFF 


4744 


18645 


A 


4775 


161 


1 


KPPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFLFFFFL.KKSWRP 
WGLKSAPAL 


4745 


18646 


A 


4776 


394 


67 


SPPRGEGGVAPPPPPKNFFFPRAVFFLG 
GGGGPMPPPQKRGFFPKTPGGVFKPPPK 
GGKFFFPPPGEGGPPPEFFKGPPPPFFF 
FFFFFFFFFFFFFFFFCFFFFFFFF 


4746 


18647 


A 


4777 


2 


409 


GVCIFSGHKHNKAPLPFSVLYSIQYINS 
LEKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKT 


4747 


18648 


A 


4778 


456 


0 


FFYKKKKKKKKKKKKKKKKKKKKSSS 


4748 


18649 


A 


4779 


438 


0 


LLKIDKKKKKKKKKKKKKKKKKKRTSKR 
GG 


4749 


18650 


A 


4780 


58 


330 


S P FYQLKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKP QQ KGGE KKKNPGE L 
GKKETCGGGGVLVGKPHKEKKRGGGKKKN 
FSG 


4750 


18651 


A 


4781 


435 


73 


FFPPPPRGKFFFKKPPGGKLFS PPELGV 
FFPPFPLKIFFFPSTPFFFWGGFPPFSP 
PPKKVFFPKFPPGFFFPPPLKKKFFFSP 
PPFFLPPPGFFFSPPPPFFFFFFFFFFF 
FFFFFFFSP 


4751 


18652 


A 


4782 


420 


109 


LFKKFFS PTEREGVFPPQP PKNFFFS PN 
PLFFGGGGAQNFPPQKKNFFPKNPPGFF 
FSPPKKKKKFFSPPGKILAPPGYFFKGP 
PPFFFFFFFFFFFFFFFFFF 


4752 


18653 


A 


4783 


230 


2 


LGLWGALYSPFLGGLGGRKFLLPWRPR 
VPLTPISPPPSRLGGPPKKKKSQAWWHA 
PWPTTREAEAGGSLEPRRL 
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peptide 
sequence 


M 
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Predicted 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIa1anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L-Leueine, M=Methionine, 
N=Asparagine, P=Proline } 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine> 
W=Tryptopban, Y^Tyrosine, 
X=Unknown, *=Stop codon, /=possibie 
nucleotide deletion, V=possibIe 
nucleotide insertion 


4753 


18654 


A 


4784 


271 


440 


VLYSKRDNNKSFKNNTYTGIGPGAVAHA 
CGPSTLGGRGGQIMRSGVRDKPYRHGET 


4754 


18655 


A 


4785 


2 


266 


KKYGIIIFCSFFLSLFLKDQGLAPLPRL 
ECSWFAAHRGLGLLGWSDPPKSAFLVA 
G I TGMHHHAWL I F KVQF SNLMG P WS YTW 
SAVD 


4755 


18656 


A 


4786 


216 


435 


HRSGDSSLiKKKKKKKKKKKKKKKKKKKK 
KKKKKIWSFWKGPFLWGGKKRKSGGG 


4756 


18657 


A 


4787 


249 


437 


MNNEWPLKTETSRPDWAHACNPSTLG 
GRGGQ I TRLGVQDQP CQHGEAPSLPENR 
KYLAWSV 


4757 


18658 


A 


4788 


107 


368 


AHGSLDFLPGGPKKSHQWFPKPKDGKPP 
PFSKKPL.KKKTPGKKGGGHPTPGARSKK 
GGYPRGGDPGGPFFFKKPGQKKKTPFKL 
RRD 


4758 


18659 


A 


4789 


1 


225 


PTRPF I TNNI FPAS P FQTT I PL YLKLTA 
IiAVTFLGLLTTLDLNYLTNKLKIKSPLC 
AKKKKKKKI KKKKKKKNEG ' 


4759 


18660 


A 


4790 


2 


216 


SGRNSRVDGRQEAACRDYQS SLEDLTFN 
SKPHINMLTILAEENLPFAKEIVSLIEA 
QTAKVFIHPAASYYF 


4760 


18661 


A 


4791 


389 


30 


EKKSPPQKKKKKFFPPPPQKKFFSPPNQ 
KTCGGGGPKKPPPKKKGSPQKNPKGVKK 
PPPKKKKKIFPPPQKKGPPPKIIiKKPPP 
PFFFFFFFFFFFFFPFFFFYFSLFFSGG 
FSYKIFL 


4761 


18662 


A 


4792 


164 


3 


LFPGFFNPGVKGEKGISFPPFFFFETES 
HSVIQARGQWRRLGSLHPLPPGFKR 


4762 


18663 


A 


4793 


550 


413 


AEMGFHHVGQAGLELIjTSS DPP ALAS QS 
AGI TG I SHRAQ PHRP I C 


4763 


18664 


A 


4794 


456 


352 


RSQDGLYLLTS S FACLGLPKCWDYRRE P 
PCLAVFF 


4764 


18665 


A 


4795 


242 


3 


KPFSGGGGAFQTAFFITFGKKFWSRLG 
LLKKKVFLKFLPQAFFFFFFETESHSVA 
QAGVQWRSLGSMQPSPPGFKRLS 


4765 


18666 


A 


4796 


218 


346 


GFSCFVFETVSLCRPGWSAWRSRLTAT 
SASRVQAI ILPQPPE 


4766 


18667 


A 


4797 


383 


80 


KFFFWQGIGGFFPPFPPKNFFFSLKPFF 
FLGGFSPFFPPPKKIFFSKIPPGFFFSP 
PLKKKIFFFPPRLILAPPRFFFKAPPPF 
FFFFFFFFFFFFFFRPI 


4767 


18668 


A 


4798 


1 


126 


ILIKLSSTWEGIQAGKELEEQHGIHCNM 
TLLFS FAQAVACAE 


4768 


18669 


A 


4799 


81 


356 


ILFFFFFFPGGGTPKKSQVFFKNPQNPI 
GKPTRMGKGEKPPP PRGKKKKKGGPPTL 
GKNKNPVPGWPGAFLPPLEEKKKPFPGP 
PPLKKKK 


4769 


18670 


A 


4800 


1 


99 


GCSEPRSRHCTLSWATRAKLHLIKKKKK 

KKKGA 


4770 


18671 


A 


4801 


238 


5 


NHQNS VQN Y FKNLR IFQLIYSLS MNLP R 
SSFKLKSMTHSISFSFLFSFFEIQSHSV 
AQDR I KWHDLGS LQ P QP PGLK 


4771 


18672 


A 


4802 


166 


62 


I FLFSFFLFLRDRVLLCYVGWSVMVQST 
QTPGLK 


4772 


18673 


A 


4803 


102 


6 


AASTTE S P S VAHAGVQWCDLGS LKPGSR 
RFQ 


4773 


18674 


A 


4804 


17 


374 


FGSRAATLFFFFFFLGRGGFFLGPRGPK 
KKTPGGEGFTRFFDPGWGKGPEKKTPPP 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














WGGKGGP KKMNLWGAGKI KFLE I KGKS F 
PRGTTPNFFGPPLFFSPPKTQGPGNFLD 
FSFKLKN 


4774 


18675 


A 


4805 


59 


175 


S FYFFLMMKKKKKKKKKKKKKKKKRGGA 
LKKKTGGGPN 


4775 


18676 


A 


4806 


406 


300 


SSGVLYSRAPTYFCGQTLTFRQVDIKTS 
HLPQEAE 


4776 


18677 


A 


4807 


123 


1 


KS IAWLGTMAHSCNPS ThGGQDGQTTRS 
G VPJD Q PGKHGE T P 


4777 


18678 


A 


4808 


319 


406 


LYKSYSSWPNWAHTCNPNTLGGQGGQI 
I 


4778 


18679 


A 


4809 


236 


351 


GFWPAPGVHACNPS ILGGQGGRVTGPKL 
GNLLGQHGEI 


4779 


18680 


A 


4810 


173 


2 


EVLFLGFRSILISLLQMNYLLYLIAKFA 
FVAQAGVQWHNLSSLLPPPPRFKRFSCL 
R 


4780 


18681 


A 


4811 


2 


94 


TRI L KVWQ KRKCS VKNGFLT I S HGT VS I 
LF 


4781 


18682 


A 


4812 


358 


2 


FFFFFSETES C S VAQAG VQWRD LGS L 


4782 


18683 


A 


4813 


362 


3 


YSEVCIISLQQAHDSLSVINFHLAECVI 
QLPVNLRELPGAVAHSPLGSELCFSDSG 
SVPSHTRSFPKAAIRLTANHRRGCTESG 
WPFSLVFLFERCMWPGTVAHTCNPSTLG 
AQGGQIT 


4783 


18684 


A 


4814 


341 


1 


WNQKRAHITKSILSQKNKAGGIMLPDFK 
LYYKATVTKTAWYWYQNRDIDQWNRTEP 
SEIMPHIYNYLIFDKPDKNKKWGKDSLF 
NKWCWENWIATCRKLELDPFLTPYTKIN 
S 


4784 


18685 


A 


4815 


2 


283 


VY-TGEE ILQKKE IT IRDQEAP YLL.RNLS 
DHTVAISSSTTLDCHANGVPEPQITWFK 
NNHKIQQEPELYTSTSPSSSSSSPLSSS 
SSSSSSSSS 


4785 


18686 


A 


4816 


266 


3 


AASTLQAHRAHQLAIDTYQEFEETYIPK 
DQKYSFLHDSQSLRQPLHTHTGSVGTRT 
LAASLSSGRTAHTRTHMHTQHTRKRTAH 
TNA 


4786 


18687 


A 


4817 


365 


1 


SFFFFFSETESRSIGRLEFSGTISAHCN 
LRLPDSSNSPASA 


4787 


18688 


A 


4818 


87 


372 


ASYKKDKYCMIPLIRVIKFPQTERKKWL 
SGLGGEGNKELLFNGCNMAVFLVLGGGF 
ETEPPSVTQAGEQWCDLSSLKPLPHRLE 
RVSCLSLPSGW 


4788 


18689 


A 


4819 


5 


175 


FAFLT I P LS RLFDNAMLRARRL YQLAYD 
TYQEFEEAYILKEQQDSFLQNPQTSLCF 
S 


4789 


18690 


A 


4820 


235 


351 


FFXFFFXFFXFFFFFFFFFXFFFXFXFF 
XFFXFFFFFXF 


4790 


18691 


A 


4821 


2 


155 


IFLLLIESWLEPVRFLKSMFANNLVYDT 
SDSDDYHLLKDLEEGIQTLMGRL 


4791 


18692 


A 


4822 


90 


2 


L FFFFFE TE S PS VTQAG VQWHDLGS LQP 
P 


4792 


18693 


A 


4823 


247 


3 


KVKGSPSHRRAMVAKGNLSLHERMKSPG 
ISNYMGNSKILFSYFKNCWPGTVAQAYN 
PSTLGRGGWITRSGVRDQPGQYSET 


4793 


18694 


A 


4824 


93 


407 


QHP KKKKKWGGRGPLKNKEKVFKHKT FN 
FFSRQKKKKKKKKKKKKKKKKKKKKGGG 
GLL.KKFWGGPKFPGGKKKFFFFFGGGKK 
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E=Glutamic Acid, F=Phenylalanine, 
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nucleotide insertion 














NPPGVFLKKNFFWGGEKKKKK 


4794 


18695 


A 


4826 


2 


129 


TTAHCSLNLPGSSDPPTSRSLVAGTTHV 
CHNAWVNFVFCRRG 


4795 


18696 


A 


4827 


342 


140 


GGFFFFPPPPFFFKFPLFLPPPPPPGPS 
CVGGVQKFTFPPPPQLSRSLFFLFFFAP 
PPETDNPFFFS 


4796 


18697 


A 


4828 


1 


259 


GGGCSEPRSCHCTPAWVTEYDKKKKKKG 
RGKGKKKIGKGGFQRKFKGPNPHGRFLR 
ESHNSPKKGFPKKFFGNPPPGTSKKVPL 
PH 


4797 


18698 


A 


4830 


125 


1 


KGTRTFETESRSVAQAGVQWHDLGSLQP 
LPPGFRRLTCLSL 


4798 


18699 


A 


4831 


1 


323 


ARGEREREREREREREREREREREPPPL 
SRGGSESYLKSAWRERLLLERDPLFFFG 
GKMGNLFPPPPPPVGRGYTGGGLITGGP 
FWWGE CVSHTHHTHTHTHT YMS V 


4799 


18700 


A 


4832 


3 


297 


TRRERERERERERERERTL 


4800 


18701 


A 


4833 


1 


150 


AREREREREREREREREREREREKKKKN 
HSRGGGPGPQFPLGGGVKKFGGGV 


4801 


18702 


A 


4834 


1 


98 


ARGERERERERERERERERERERERERE 
RERGAGGGPT 


4802 


18703 


A 


4835 


2 


66 


HEERERERERERERERERERAAR 


4803 


18704 


A 


4836 


2 


73 


PEERERERERERERBRERERETAR 


4804 


18705 


A 


4837 


146 


2 


FGLVYS ISQFCLFFVRRSLALVTQVGVQ 
WRNLGSLQALPPGFTAFSCL 


4805 


18706 


A 


4838 


354 


0 


PPPPPPSPPPPPPPPPPSSPPPPSPPSP 
SSSPP 


4806 


18707 


A 


4839 


62 


358 


GIjVHD khs PHGG cg s hhvl p s lnp s f p t 

FSTKS PSVPPARALWPPDSRWALQVGHL 
GLT P T TRQES AQ APRRKLALLEGS FQGQ 
AQWLTP VI PALWEAE 


4807 


18708 


A 


4840 


121 


13 


RPGRPPPEFTPFSCLSLPGSWDCRRPPP 
RPANLLYF 


4808 


18709 


A 


4841 


3 


221 


LVMGF VGFSSKPS P I YGGLAL I VSGWG 
C V I I LNFGGG YMGL I VFL I YLGGMM VG F 
GCILDFSLNFNVGLNF 


4809 


18710 


A 


4842 


3 


84 


CHCTPAWTANLDS I KKKKKKRKKFKN 


4810 


18711 


A 


4843 


33 


230 


LTVT I S KAAE VLKHKKKKKKKKKKKKKP 
KKKTPKKNPNPPPQKGELGKKIFFFFWE 
RFLPQKKRP 


4811 


18712 


A 


4844 


3 


368 


HELNPRGRSCGEPRPCHCTPAWATRAKV 
HITKKKKKKKGFNLFKPGGSLSNFFPKK 
KETPWSPQNFEKKGPRRFSPRGKNPPLA 
LRGKKKGGKSLSGKIFPPFPRGKTPPWA 
FMGTQKGEYS 


4812 


18713 


A 


4845 


163 


2 


HHGHS I S KFLIiTVS S PKS FNQLRPDAVA 
HACNAS TLGGRGGW I TRSG VQDQLV 


4813 


18714 


A 


4846 


387 


234 


KGCGEPRS CHCPPAWATIAKLLFKKKKT 
KPKKNPKKQEKTFPYNMRVCY 


4814 


18715 


A 


4847 


189 


349 


KFSRFSGQKALLLKDILRLGTAAHTCNP 
STLGGQGRWIMRCQVFKTSLAKMVK 


4815 


18716 


A 


4848 


3 


95 


HENF YCPGSALGRNEGNI FPNPEATFVK 
EM 


4816 


18717 


A 


4849 


2 


325 


DSHTI IMGDFNTPIiAILDRSTRQKVNED 
IQDLNSALHQADLIDIYKILHPKSTEYT 
FFSAPHRTYSKIDHKIYSQGNHETDCRY 
GKKGGGWKDERFPDVDCEKFKSQ 
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peptide 

sequence 


Predict- 
ed end 
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location 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \= possible 
nucleotide insertion 


4817 


18718 


A 


4850 


241 


348 


SFFSQTQASEQLTKSKQGLALRTLPPVQ 
FPPPPPRP 


4818 


18719 


A 


4851 


2 


176 


LVETGFCHVGQAGLELLTSGECWDYRCE 
PPSPAYCFLHTIVTIYPLLCLAFFSYLC 
DC 


4819 


18720 


A 


4852 


2 


191 


VGRVGLELLAS sdlptsgs ps agitgms 
HCPWPNFLLIFIGYFFTVLTPCEYYSGG 
MTVPLWL, 


4820 


18721 


A 


4853 


67 


376 


IKCTTFNIKKKKKKKKKKKKKKKKKKGG 
ALKKKPWGGQKKTGEKKKKFFSKRGGKK 
KRGPGRVLKKGEREKIFFWNFLKKNFFW 
GGGNLGQPPPKKLRALGKK 


4821 


18722 


A 


4854 


278 


363 


IWPDWAHTCNPSTLGGQGRWTMRSGVR 


4822 


18723 


A 


4855 


3 


282 


RGCSEPRLRYCTPAWATEQDSASKKKKK 
KKTQ I FWEGPGGGVS PFKPHFFKGPGGK 
KPWGQKFKNPPPQWEKPLFFFKKKKTKF 
SPGGGGPP 


4823 


18724 


A 


4857 


382 


224 


PVIQLIGGVRQEKNFNLGVKGFNEPKSP 
P C P P T W VAKQNS L S KKKKKK I GKK 


4824 


18725 


A 


4858 


392 


102 


FFFFFFLRQGLSWQARVQWCFCSNLGL 
LG S SNP P T S AS L S C F LNKKE YRWENVTN 
KV 


4825 


18726 


A 


4859 


3 


111 


FHRISQEGLILLTLGSAHLGLPECWDYR 
REP PR PA 


4826 


18727 


A 


4860 


282 


83 


AQ I YKVQNWGS SGNKCI CVC VC VCVC VC 
VCIYIYMLCMSRFKYKCIVGCMNIDSYV 
VYIPNIQCYI 


4827 


18728 


A 


4861 


3 


417 


NHSNLGGRGCSDPRSRHCTPAWVTERDS 
I S KKKKKKKKK I FFGLREKKL TL FP FLG 
KKNENSGFPLNPNFFFGGKGKKATLGPL 
GLVLRGVPFFRGGETQNGENQKPGTTLE 
KFEKHFWVGPPPLEKISPHKGKQTQK 


4828 


18729 


A 


4862 


1 


88 


MEEMYKKAHAAI LENPVYEKKP KKKKKK 
K 


4829 


18730 


A 


4863 


274 


369 


VSLCCAGWRAAVRS Q I TAAL I SRAQAI L 
PPQP 


4830 


18731 


A 


4864 


37 


444 


DPRVRSEKKIPFKILLLIDNAPSHPKAL 
ME1YEEINVIFMPANTTSILQPMDQGVI 
S TFKS YYLRNTFHKALAAMDSDVSDGSG 
QSKLKTFWKGFTILDAIKNIRDSWEEVK 
L S TLTG VWKKL I P TL I DD YEG FKT 


4831 


18732 


A 


4865 


341 


210 


VYMCHVGLL YHQ P V I YIRNR I E AS E I T P 
HIYNYLIFDKPSHQF 


4832 


18733 


A 


4866 


206 


412 


QLCLS QACTLARGNAKGFLKY IHRNNVS 
MP S VAGHT WGPD Q LVKGQ GE T C PAT I HF 
SDSPSLKYTMYPM 


4833 


18734 


A I 


4867 


248 


78 


SHFEAGLAAQLTVSWFPQGTPVFVHAGP 
FANIAHGNS S I I ADR I ALKLVGPEGFVG 


4834 


18735 


A 


4868 


637 


2 


E FDGRWPRQ I VS S I GL CR YGGR I D CC WG 
WARQS WGQCQP VCQPRGKHGE C I GPNKC 
KCHPGYAGKTCNQDEHI PAPLDQGSEQP 
LFQPLDHQATSLPSRDLNECGLKPRPCK 
HRCMNTYGSYKCYCLNGYMLMPDGSCSS 
ALTCSMANCQYGCDWKGQIRCQCPSPG 
LQLAPDGRTCVDVDECATGRASCPRFRQ 
CVNTFGS Y I CKCHKG 


4835 


18736 


A 


4869 


276 


426 


VSFFLFFFFWGANPPFVPQAGGQGGFLG 
SLNPLPPGLKHFFCLTPPSSGN 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine> 
T— Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


4836 


18737 


A 


4870 


100 


485 


EERNREMTERCSVTQAGVQWHDLGSLQP 
SPPGFKRFSCLSLPSSWDYRLECDGMNL 
AHGKLRL PG S S E S P ASAS RVAGI TGMCH 
HAPLARLVSNTLPQVILPPLGLQQAEEL 
PRRQEVEHNTPSLVPI 


4837 


18738 


A 


4871 


718 


408 


EVLTLQLAHFPGFLGAHWWNQQDAALGR 
ATDSKEPPEELCPDVLYRTARTLHGQET 
YT P RL I LMDLH YD VS LGAL VL TAAG YRA 
SSCSQSVAVCTGSVLEARA 


4838 


18739 


A 


4872 


392 


246 


RGGCSELRSCHCTPAWATRVKLHLKKKE 
KRKCKI RPEGNE I LVNNAKG 


4839 


18740 


A 


4873 


1 


156 


GGGGYSEPRSCHSTPAWTTRAKLRLLKK 
KKKKKGGVC FGVGFLG P R P KGG FF 


4840 


18741 


A 


4874 


246 


2 


AHGI S I YASMLAMS I FSRRNVS FRRAGQ 
VKAVFPVSWFGPLHRHHLGEHFLFFLR 
QS FTIj VAQAGVQWCHLiGS PQPLPPA 


4841 


18742 


A 


4875 


72 


2 


CFPAAPDEDSTTNITKKQKWTVE 


4842 


18743 


A 


4876 


375 


221 


GRLRQENRLNPAGGGCGE PRSHYCTPAW 
VTEQDSISNICIYINIFSWHGQ 


4843 


18744 


A 


4877 


151 


2 


RAP F F F F FF F F VAQAGVQ WRVTjGS L Q AP 
PPGFTPFSCLSLPSSWDYRRP 


4844 


18745 


A 


4878 


401 


70 


PHRREGGCVPPPPPKNFFFSPGGYFFWG 
GGGQNPPPPKGGFFPKTPPGFFFSPPQK 
KKIFFFPPPERGPPPGFFLRPPPPFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFF 


4845 


18746 


A 


4879 


4 


152 


LPYSTPSTPFRAKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKRGGNFK 
KR 


4846 


18747 


A 


4880 


267 


1 


TPPKKKKKKI I PPPKNGPPPHI FKKTPP 
PIiFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFRTRGRTRGRTRGRT 
RGR 


4847 


18748 


A 


4881 


2 


408 


LQDATSPIIEELITFHDHAIiUIFLICF 
LVLYALFIiTLTTKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKGGGGP KKKNRGG 
P KKLGGE KKI FFFFKGGKKKTRGGVGKK 
KIFLGGGNFGPPPPKKKKPRGKI 


4848 


18749 


A 


4882 


413 


69 


LPPPPPGFKHFWAPPPPGRGVPGPPPPP 
RVNFVFLGKKGVS P FGPGGPRPLFPNPG 
GVGAGDPLDPRGQGFHGPKFMP CPPARG 
TKREPPFPKKKKKEKEEKKEPERKEKKK 
EI 


4849 


18750 


A 


4883 


46 


394 


KQYQVSVKLLFVTQNLGTKKKKKKKKKK 
KKKKKGGAL KKKPGGGQKKP GGKKKNF F 
L KRGGKKKP PG I FE KKTLFGGGENWGT P 
PQKKKPPGKKKKILKGEGGKKT IiYFGRG 
KNFS 


4850 


18751 


A 


4884 


399 


3 


FFFKKFFSPNEFWFFFPPFPLKIFFFPP 
RIiFFFWGGLAQFSPPPKKVFFSKFPRGF 
FFPPPLGKNFFFFPPGYFWPPPGFFLRA 
PPLFFFFFFFFFFFFFFFSVSIRNAIAI 
RMGTMRSRRLtAMGMLLRRG 


4851 


18752 


A 


4885 


391 


47 


PH FF WGGVF P P P P P KKKKFF FL PQ KGGG 
GGAPKKKKKKKKFLPPNFSGGFFFFSPP 
GGPPRFFFFIFFFFFFFFFFFFFFFFFF 
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K=Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














FFFFFFFFFFFLS KMFTF I LKYTLKQNE 
FY 


4852 


18753 


A 


4886 


32 


379 


LIjPVKKKKEKKGEKKKKKRKKKKKKKKK 
KKQKKKGGGPSIKI IGGGHLS SGGRKNN 
S FSHRGGLKKPLFFLGRT YFLGGGGLWP 
HP P P KK I AGGAKN I ILRGWGEQKPLLFP 
PGEK 


4853 


18754 


A 


4887 


178 


33 


I FKRKWFFFRFFFFFFLYYFFFKFFFF 
FFFFFFFFFFFNICHHPKLY 


4854 


18755 


A 


4888 


238 


3 


KPFNPFKKKKIGPQKKKKKFPPFLKINP 
PKYKTQNINIYKYIKIFFFFFFVRGSDS 
VAEAE VQ WHHLG S L KP AS P G F K 


4855 


18756 


A 


4889 


93 


1 


DGILLLS PRIiKCSGT I SAHCNLCPPGS S 
NSR 


4856 


18757 


A 


4890 


6 


219 


IEKAHKALGTVPGS TFVRYHHYHHHNHY 
HHHCHHHHPHQDNSEWEKAQSLEPDKLF 
LNPGFATSQLRGLR 


4857 


18758 


A 


4891 


1 


364 


TSSRSRAGRPMDFGFFFKKKGHREPPGG 
VE KKKKNWGSRKT P P S G VKNF P GFDP P K 
MWNKGGPPTPPVNFFGFKKKRGFHKGLK 
PPPPEIAPLNPQKGGEKKGGPPPPPTPF 
LGGKFQGKR 


4858 


18759 


A 


4892 


387 


81 


GVFSPPRGGPPPGGKTPPFFPRGGPPCG 
PGGRGPIFPPLSHKKFFFSPPPPLLGGG 
GGPKSAPPPKGVFKEFPENFFWGAFIKK 
SFFLKKGGFFGPPRGFF 


4S59 


18760 


A 


4893 


213 


14 


KTPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFMLE VMFLVTJRRGKI CRVPFTFFNL 
SL 


4860 


18761 


A 


4894 


384 


10 


PLPPFIPPPPPPPPLFGGGVCPYFSPPP 
HLGSPSPPPGGRPSPPPPPPRSTPPPPP 
PPFFPPPPPFVSGAPPPPPRSCIiAPPFF 
FFFPPPFFFFFFFFFFFFLPFFFIiKKSD 
FIFSLQMKYFMLI 


4861 


18762 


A 


4895 


240 ■ 


355 


Ij 1 1 T PAL F F KI F VLW LGAVAHACNP S T L 
GGQGGQIMRS 


4862 


18763 


A 


4896 


339 


3 


GVGWHLFFPPQKKFFFFFQVTGASCAAK 
SSKESHKGLHLNIYLPKIFFYRPPPPFF 
FFFFFFFFFFFLFFFFFFFFFFFFFFFF 
FFFL WREG YNQHFRGMGP IGRVGGRV 


4863 


18764 


A 


4897 


56 


335 


TCFLYKLKKKKKKKKKKKKKKKKKKKKK 
KKRGGPLKKNLGGPKKNRGGKKKIFFFK 
GGEKKNLWGFFEKKPFFGGGKICPTPPQ 
KKKPPGEKK 


4864 


18765 


A 


4898 


7 


206 


L C LQ P DNRWNDDQ S KS FL I E KKKKKKKK 
KKKKKKKKKKKKKGGGPLKKNPGGAPNN 
PGGEKKNSPP 


4865 


18766 


A 


4899 


74 


250 


INICSEKLPKKKKKKKKKKKKKKKKKKK 
KKKGKK I KRGLNS QT S LLKKRE KKKKRG 
KI 


4866 


18767 


A 


4900 


151 


312 


KYHFHKNYFFSLLYFSQNVSQLS PDGPL 
PQLPLP YINS SATRVFFGHDRRPADG 


4867 


18768 


A 


4901 


1 


364 


LNLGGRGCSEPRLRYCTPAWATEQDSAS 
QKKKKKKNPNF 


4868 


18769 


A 


4902 


1 


96 


GGGGCGELKS CHCTPAWVAEQDS VS KKK 
KKRG 


4869 


18770 


A 


4903 


3 


225 


GFHRVGQDGFDLPTSWTARLGLPRGWDW 
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nucleotide deletion, \=possib!e 
nucleotide insertion 














RREPPRPAVWAGLQLLTSSDPPASDSQR 
AGITHLSHLTLFCHLRF 


4870 


18771 


A 


4904 


384 


194 


HWVGRPFFNLWGQKNPPPPPPKRLEYQG 
WGPPPPPPFFFFKKTPIFSQVFNPPPFF 
FFFFFFG 


4871 


18772 


A 


4905 


3 


387 


TIITPILLTLFLITQLKILNTNYHLPLP 
AQYPLPQKKKKKKKKKKKKK 


4872 


18773 


A 


4906 


1 


327 


PTRP I E P I PENP KFSVPPIT PHPKKKKK 
KKKKKKKKKKKKKKKKKKKKKKGGALKK 
KTWGGPKKRGGGKKIFFFLRGGKKKPRG 
EFLKKTSFWGGENIATTPPKKKKARKKK 
KNF 


4873 


18774 


A 


4907 


134 


2 


ALY S IiKVKWPGAVAHACNPNTLGGQGEW 
IRRSGVRDQPGQHSET 


4874 


18775 


A 


4908 


59 


449 


ATRH Y T I MLAYT S VG I P I TAWLGKQA I 
KGARFQWRTTENYDKQQLIHKGGGRPP 
GCHS THTVLL P P VT WVKTTGNS TVANF V 
LjILVCLSFLLLVYRCIQQLQRDSNQREG 
AMMMVWLSKRKGGYAGK 


4875 


18776 


A 


4909 


337 


3 


LKTAWATRGNP PLYKKNTQ I S GARGEGP 
PIPLIGGGLSQKNFFTPGGENSINPDPP 
PSPPPGAKKETPPPGKKKKKKPTTKELG 
KDWTLELWDSDSQRAGKLAEVPGESSP 


4876 


18777 


A 


4910 


14 


162 


AE S GAKRET AF P F AP QQG ATD KRLN 1 1 Y 
AAKKHMKTCL P S LAIREMQ I K 


4877 


18778 


A 


4911 


331 


2 


LPIiLAPKEGQGKILGYKKPPPPGLPPFF 
GS T P PRNGEKGGP P PPRGNFW VFKKNGG 
FPRLGGRGPFPGPQKPPPRPPQKGGKQG 
GNPAAGPFFFFFE TEFHS CCPGWSA 


4878 


18779 


A 


4912 


116 


316 


ACAHAHTLFFFFFLKKKPPFVPLLGGKG 
GNLGYLKLiWL PGLKKVCLT P PRTGNYGG 
APPHRIIFCFL 


4879 


18780 


A 


4913 


93 


16 


SPPGWPGTVAHACHPSTLGGRGGRIT 


4880 


18781 


A 


4914 


316 


2 


KPRPGNPLRARVFNPPGPPGETPFFPKN 
PKFTRGGRGGPIYtfPFPGRVRPKNFLYP 
RGSRFHWPNS I PCPFAWGTKPNFFSKKK 
KKKRKKKEKLCFSASVPDAQ 


4881 


18782 


A 


4915 


3 


289 


TS CNPSTLGGRGGR I TR VGVRGQPDQHG 
ETPSLLKNYI YI YI CVDVYLYVCMYVGA 
YI YI CI CGYI YI WRYLSLTSEHTHTRAP 
GQYRVIYFFCG 


4882 


18783 


A 


4916 


1 


222 


ARGERERERERERERERESRHQGGGAGP 
PTHLWGEEGWLS S PQTGVRE KNAP I WG 
CAAPVLAPRVGRTPFGVD 


4883 


18784 


A 


4918 


1 


281 


ARGERERERERERERERERERERERERE 
RERERERERGAPPPPPRGGVRERGTPCP 
S ARGARALS PT I S CGDPHRGP KI YRGVG 
VHTQRYFSVGSSLCINTHT 


4884 


18785 


A 


4919 


1 


259 


ARGERERERERERERERERERVFVEKGD 
MVTLPAGIYHRFGGGEKKYTKAVRVCVR 
KPGWRAHNRPADHFGARGPHVKFLAQTV 
YE 


4885 


18786 


A 


4920 


797 


910 


NTMQPGWTHTCNPSTLGGQGGRITGSG 
VQDQPGQHGE 


4886 


18787 


A 


4921 


2 


67 


LEERERERERERERERERERERSS 


4887 


18788 


A 


4922 


1 


698 


TLLLAELGTICDPYRSCSISEDSGLSTA 
FT IAHELGHVFNMPHDDNNKKKKRGGPF 
KGPNFSPPVRASKNFFFGPPNLNSWARF 
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LTPGEGKTPGVPPFNPFATKAEL 


4888 


18789 


A 


4923 


323 


2 


LVCRGTIHKFRCVPHLTGRRFEHGVTDC 
YTLFRDAYHLAGIEMPDFHREDDWWRNG 
QNLYLDNLEATGLYQVPLSAAQPGDVLL 
CCFGS S VPNHAAI YCGDGELLHH 


4889 


18790 . 


A 


4924 


3 


225 


HEEALHLFQTLMECMKRKELITVFHIGS 
DEHQDIDVAILTALLKGKPLRTFLFVRP 
FILCMYMCVYVCICVCI 


4890 


18791 


A 


4925 


371 


169 


HTQS ISVVLVERRFHHSGQENPLNPGGR 
GCGELRLCHCTPAWATRAKSRLKKTKQT 
KKDAI KGSLCN 


4891 


18792 


A 


4926 


2 


352 


ARAARAGRIIKELFFFFFFGNLKKKKGL 
FWWPGGVKNPDLGJSTPPPWPPKGGGITGL 
TPRPGPMGVI FKKFGKKGSPSPTGPKSL 
GPRE FFGLAIiQRGGDLGLNHRGP PF FFG 
VLETM 


4892 


18793 


A 


4927 


2 


379 


ARANMEETQQKSNLELVRISLLLIKTGL 
EP VR I LRSMFTNNLVYDTS DSDD YHLLK 
DL QEG I Q TLMGRLENGRRRTGQ I LTQT Y 
S KFD TNS DNHDAL L KN YRL L YCFTKDMD 
KVETFLRMEQCRSV 


4893 


18794 


A 


4928 


175 


31 


ILGDLFPPAWLHPFLFLLLPFSRPSLAV 
TEVAVQWRNLGLKQPLPPGF 


4894 


18795 


A 


4929 


368 


247 


VDRLFPCCPGWSPSLELNQSACLSFLKC 
WDYRCEPLCSVS 


4895 


18796 


A 


4930 


2 


no 


ARGEPRSHHCTPAWAMSETVSKKKKSGG 
LFFLPRLV 


4896 


18797 


A 


4931 


150 


1 


KYVAPCRPLFLVLSPCRRSCFPFPFCHD 
CKFPESFSEATMFFLQPAEPRA 


4897 


18798 


A 


4932 


2 


155 


ARDDLNPGGGCSELRSRHCTPAWATERD 
SIS KKKKNPQNI WGNLKKKWNLK 


4898 


18799 


A 


4933 


144 


249 


ETLYPQPPQSLNASCVCVCVCVCVCVCV 
CLCVCV 


4899 


18800 


A 


4934 


2 


153 


ARESFVEKGFRHVGQVGLKLLASGNPPN 
LATQSAGI IS ISHCTQPWRQGL 


4900 


18801 


A 


4935 


1 


355 


GTSQE FEGRKDRLiAS PLETGNAGAGRER 
LAEVSTCPSGASKPLQTPRPGGQEGTPG 
LGGRRMRHSAGPSPNPAWES IRDPSLTS 
TALL VAC S I FH I HED P I LHD VL I S R CM S 
IKTSHR 


4901 


18802 


A 


4936 


112 


358 


NI IKFSKAFFFSFYI FIL INSVGLASTF 
MWGGELKFSFTSGILIIFQPFKIKGHPQ 
RG I LRTGTFKEDPDDPEGNKVS SLLQ 


4902 


18803 


A 


4937 


1 


258 


GTS P Y'VTQQFFSGWGRQGLTLS PKLEHS 
ELRLCHCTPS WVTERDS I S KKKKKKLF Y 
LAG P T F F P FGGGNFNLALGRLL KRGGGE 
KK 


4903 


18804 


A 


4938 


380 


1 


FFFFFSETEPYSVAQAGVQWC 


4904 


18805 


A 


4940 


255 


I 


LGNNLAGYKIRGL YS ISLRNNRHPSPVI 
SLCGNLFYVIRFFDMESCSVTQAGVLWC 
NIGSSQALTPGIAPFSFISLPSTRQNSR 
P 


4905 


18806 


A 


4941 


10 


173 


KYIEYDTNKWKHNSCSQIGRINFVKMSL 
PPKAIYTFSVFPIKIPSVYLPVHLSI 


4906 


18807 


A 


4942 


144 


3 


DKC I RRPGTVTHT CNS STLGDQGGRVMR 
SGVRDQQGQHSETPSVRV 


4907 


18808 


A 


4944 


484 


274 


PSSQRSPRTARMIALSTRPTTRSMKFCH 
VAQAGLKLLGS SDLPASAS QNVG I SGVS 
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HHTWPRFYYIjS 


4908 


18809 


A 


4945 


103 


412 


KKKKKKKKKKKKKKKKKKKKGGGPLKKN 
PGGAKKKRGRKKKIFFFKRGEKKWPGGI 
FE KKP F FGGGKNWAP P P KKNKP FGEKKK 
FLRGKGGKNPP I PPGKKKL 


4909 


18810 


A 


4946 


309 


81 


KFFFFLKGL I FLGGFCP IFPPPKKIFFF 
KIPPGVFFSPPFKKKNFFFPPPLIFGPP 
RVFFKGPPPFFFFFFFFFF 


4910 


18811 


A 


4947 


18 


264 


NYTQRFKVEYLNTSTI INQLDLTDI YRILY 
PTVEYTFFKNPHGTLCRTDHMLGHNTGL 
NKFKRTECTQSMSSDHNGIKYHSAP 


4911 


18812 


A 


4948 


418 


293 


QLLRRVRQENCFNPGGGGCSEPRSPPCP 
PTWVTKQVFFSKP 


4912 


18813 


A 


4949 


3 


252 


DSABCHLGLKHWYCGLENVKRLTDGKLE 
VPHFDSKGEVEEYFWS IGIPMTSVRVAA 
YFENFLAAWRPVKASDGDYYTLGKTK 


4913 


18814 


A 


4950 


434 


270 


KRGFPPKTPPGFKKPPPKKKKKIFPPPP 
KIGPPQKILKKPPPPFFFFFFFFFFF 


4914 


18815 


A 


4951 


399 


1 


VGFSLFPPPQKKGFSPKPPRGFLFPPPK 
GKKKIFPPPGKIGPPQGFFKRPPPLFFF 
FFFFFFFFFFFFFFGPPSVTLLFLVTIL 
LQIQTTALGRKSWGKGFRIPKPVCPVPA 
QQTKQGAKHLLGQWE INSCA 


4915 


18816 


A 


4952 


235 


56 


F TWP P P KKKKKNP P GGGGP VP P P WGGK 
NKKIFLPPKFKGPFSFGAPLPFPPGQKK 
KPL 


4916 


18817 


A 


4953 


13 


194 


S ILISLQI SLI ITFTATELI I FYI FFET 
TLIPTLAIITKKKKKKKKKKKKKKKKKG 
GGAF 


4917 


18818 


A 


4954 


177 


1 


PPQKKKKNIFSPPGKIGPPQGFFKRGPP 
LFFFFFFFFFFFFFFFFRFEGECSDLDG 
RV 


4918 


18819 


A 


4955 


450 


0 


LPNKADKKKKKKKKKKKKKKKKKKAR 


4919 


18820 


A 


4956 


29 


158 


AMIVPLYSSLGDFFKKKEKKKKKKKKKK 
KKKKKKKKKKGGGGF 


4920 


18821 


A 


4957 


397 


0 


PSSPPPPSPPPSPPPPPPPPPPPPPPPS 
SPPPAPPPPPPPSPPP 


4921 


18822 


A 


4958 


257 


397 


FFFLSFTFVAQAGXQGXXYGSLDPPPPG 
FNIiFSCPSLPRIWNYTAPP 


4922 


18823 


A 


4959 


419 


266 


RFHHDGQDGLELLTSSDLPASASQSAGI 
IGVSHRAQSIKSLAPQYWYFPSF 


4923 


18824 


A 


4960 


300 


190 


FFFFFFFFFIVAFFFFFLFCFLFFFFFF 
FFXIFNNIF 


4924 


18825 


A 


4961 


3 


179 


DAWVLPF 1 I AALAALHLLFLHETGSNNP 
LGITSHSDKITFHPYYTIKDALGLLLFL 
LS 


4925 


18826 


A 


4962 


2 


182 


RVNAKDS KWLTPLHRAVAS CSEEAVQVL 
LKHSADVNARDKNWQTPLHIAAANKAVK 

CAKLt 


4926 


18827 


A 


4963 


273 


440 


KIHIQHCWWEYKLLQSLWKAAWHFLKEL 
KAELPFNPAIPLLDIYPEEYKSVYLKDT 


4927 


18828 


A 


4964 


179 


3 


SLPFCCNLTPKKKLFFKFCRKNGFFFFK 
ME S HS VTQ AGVQWCNLNS LQPCPHGLKQ 
FS 


4928 


18829 


A 


4965 


268 


364 


T VC VI E LLT S GDRP AS AS Q S AR I TGL S H 
HAQP 


4929 


18830 


A 


4966 


374 


2 


QNFFLKKKKKKRVGPGVPPFNPPPFWGQ 
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GGGFPRSILGNPIKPLFFLNSPKISPPL 
WPGPFFPLPWGVWAGKNPLPRGGGFPLT 
KFPPLPSSLGKKKKFRFKKKKKKKGQAR 
WLTPAI PTPRE A 


4930 


18831 


A 


4967 


79 


346 


DKIFSNISCGIVIDLLAIYNWDAYATRL 
GIYKHWDFIIIDKDTSSCRLSFSSYPRF 
IjES LDD FYILSSGLI llqttns vfnktl 
LKQVI 


4931 


18832 


A 


4968 


3 


93 


FLHVGQAGLGLPTS GDP PASAS QSVGLQ 
A 


4932 


18833 


A 


4969 


158 


1 


SKSNFAYIFPIKDNGBTGKKTINRWTK 
NWANDLNRHFSKWIQMAKRYMKK 


4933 


18834 


A 


4970 


2 


266 


VGQ AGVE LLS S S D P PAS AYE GAG I P GVS 
HHTWPKHFFPALFVACISSLWCLFQLF 
AC I S I GlaS F F FF F FREGDLGNLLKI AGL 
TFLG 


4934 


18835 


A 


4971 


3 


96 


GVRHHAHLI FVFLVETGFHHVGQDGLNF 
L 


4935 


18836 


A 


4972 


253 


348 


NF I FLLF I YFEMESHS VAE AGLQWHCLS 
SLQP 


4936 


18837 


A 


4973 


326 


2 


PMEIPQHKLSLWPGFAISVSYFERKLLF 
SADVSYKVLRNETVLEFMTALCQRTGLS 
CFTQT CE KQL I GL I VLTRYNNRT YS I DD 
IDWSVKPTHTFLKRDGTEITYVDY 


4937 


18838 


A 


4974 


3 


244 


DAS LVFKVAETANEEE VKKMCM YKYPGN 
KCGR EGGRE KKRE E ERREGR KEGRE GLR 
EGETDEEREEGEEGLSYSPFKNSI 


4938 


18839 


A 


4975 


6 


79 


THFSLTI TSLQPEDIGTYYCQQYD 


4939 


18840 


A 


4976 


15 


339 


PGE AGNCLNPGGGS CSEPRSRHCTPAWA 
TERNS VSKKKKKKKKKKKI PRGRGLPPV 
S HP F W KAGGAD W FD LGTLE P P W P TGE T P 
VFKKKKKLTGGGGAPLVAPTWGGG 


4940 


18841 


A 


4977 


204 


275 


I IEVDPDTKEMLKLLDFGSLSNLQ 


4941 


18842 


A 


4978 


3 


352 


RRPS PHGLVGAVS VGGAGVMAVETLS PD 
WEFDRVDDGSQKIHAE VQLKN YGKFLE E 
YTSQIiRRIEDALDDSIGDAWDSNLEPMA 
WRRLPYEQSSVLELIKTENKVLNRWTG 
YAGL 


4942 


18843 


A 


4979 


219 


3 


AASTMAMSFEWPWQYRFPPFFTLQPNVD 
TRQKQLAAWCSLVLSFSRLHKQSSMTLM 
KSRLTSLS ITVMLDC 


4943 


18844 


A 


4980 


142 


3 


FQGFFFLRDKVLLCCPGWSRTPELKRSS 
CLGLPKCWDYRRELPRLA 


4944 


18845 


A 


4981 


351 


2 


FFFFFSETESRSVAQAGVQWRDLD 


4945 


18846 


A 


4982 


133 


3 


SFFFFLFETECCSVPQAGVQWCDLHPLQ 
PLPPGFRRFSCIiSLP 


4946 


18847 


A 


4983 


3 


161 


LGSLHDTANTLWPRLERSRTIMAHYSLD 
LPGSSDPPTSASHVRGTTGMRSTRP 


4947 


18848 


A 


4984 


363 


3 


GPSNKKRGRSKKAHVLAASVEQATENFL 
E KGDK I AKE S Q FL KE E LVAAVE D VRKQG 
DLMKAAAGEFADDP CS SVKRGNMVRAAR 
ALLSAVTRLIilLADMADVYKLLVQLKW 
EDGILKL 


4948 


18849 


A 


4985 


101 


1 


LQVC YGRR WMNS RE YGAW KQQ VE S KNM 
PFQDA 


4949 


18850 


A 


4986 


341 


1 


P P PM S P PD VXjKT S p vadaagw vd vdke t 
LQHQRYPNVFGIGDCTNLHTS KTAAAVA 
AQSG ILDRP IS VIMKNQTP S KKYDGYTS 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CPLVTGYNRVILAEFDYKAEPLETFPFD 
Q 


4950 


18851 


A 


4987 


86 


1 


KCSGT I S AHCNVRLLGS SNSGP SASRAI 


4951 


18852 


A 


4988 


124 


3 


LPWLCSKFETKTFQFQIYYKAKGVKTVW 
YSHKDRNIDQWN 


4952 


18853 


A 


4989 


305 


150 


WLLNYRYSHSAEVQVCFQFLTWVFSTLG 
S PDHFS KFFNQYFFKI CFD Y WKN 


4953 


18854 


A 


4990 


64 


340 


KKKKKKNLLKFPGQREKPLSPNFLGGLG 
GGKNGPPGGKPSRDPDWPPGIKPGEKKK 
NP LKKKKKKKKKP L S KGGGG KKT P LKKK 
KKPPPKGF 


4954 


18855 


A 


4991 


3 


158 


P S LVQTRLREAGQAGLKLIiTS TDS PAS A 
SQSAGTTGVSHHAMPKDCSLNSN 


4955 


18856 


A 


4992 


116 


66 


FFFFFFFFFFFFFFFXE 


4956 


18857 


A 


4993 


3 


107 


GQAQ WLTPVIAALWEAKVGGS FE VRSLR 
PAWPTQ 


4957 


18858 


A 


4994 


1 


105 


KLDRLARHGLYE KKKTSRKQRKERKNRM 
KKVRGTW 


4958 


18859 


A 


4995 


110 


3 


RRIGRYKTVFLCTQLEILMARYPLPPSP 
KPIKIKN 


4959 


18860 


A 


4996 


109 


3 


DEVSILCPRLECNGTILAHCNRRIiPGSS 
DSPALAP 


4960 


18861 


A 


4997 


1 


148 


ACCPFCTIYLLPMFMIIKAPLMGTSKKK 

KKKKKKKKKKKKKKKKKXKKKKKKKKKK 
K 


4961 


18862 


A 


4998 


3 


110 


VI LQGSNDVELVAEGNSRFT YTVLVDGC 
SKKTSTRP 


4962 


18863 


A 


4999 


3 


330 


P IADRGAEYVSAREWMTICFELLQLLKA 
HKKAI RRATVNTFGY I AQAI GPHDVLAT 
LDNNL KGHE RQNRVCTT VA I AI GAE TC S 
PFTALPALMNEYRVPELNVQNGVLK 


4963 


18864 


A 


5000 


3 


110 


VI LQGSNDVELVAEGNSRFTYTVLVDGC 
SKKTSTRP 


4964 


18865 


A 


5001 


25 


329 


NSRRRRNDQGSPNLCQTFTFMTLPYLPE 
HRS LLLKI RS CAERETKKKDD I PEEDKG 
NI KQCE IN YVKKFQ S FQDfiKL K I S KEE S 
KILKKAQKDGFLHETLLN 


4965 


18S66 


A 


5002 


278 


345 


EDEEGYNDGEVDDEEDEEELGG 


4966 


18867 


A 


5003 


101 


3 


VR 1 1 I SGTGKKKKKKGKLPKN YDPKVTP 
DPER 


4967 


18868 


A 


5004 


18 


350 


VSHECLIFHIQNVLiTGLVISLSGPSVPS 
HTHKHTHIHTHTHTHTHTHTFS 


4968 


18869 


A 


5005 


26 


147 


KE E VRMAL FANDM I V YLEN P F VS AP YVL 
KLISCFCKGRLG 


4969 


18870 


A 


5006 


85 


1 


SASRVAGI I GKHHHTRIi I FVF L VE TGQ S 


4970 


18871 


A 


5007 


370 


3 


RARKPLLWVLKKIWSPWIYKKKTRINFQ 
KEKKSPFFSRPFFLFKKGGFLPPGVFFS 
RGFLKKIPPFFFFFFETESRSVTQFGVW 
SVQWCNLGLLQPEPPGSKQLLCRGIiPSS 
WDYRCLS TRP 


4971 


18872 


A 


5008 


1 


85 


IRILSKIKNALTHFLPQGTPTPLIPILF 


4972 


18873 


A 


5009 


218 


93 


FFFFFFFFYFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


4973 


18874 


A 


5010 


3 


250 


RPRRRRLRQENHLNLGGEGCSESRSRHC 
TLTWATEQDSVSKKKKKRVFPPPHPRGK 
FFWAGGPTFFPPLKTVPFPGGGKPNF 


4974 


18875 


A 


5011 


336 


0 


SSSSPPSSSSSPPPPPPPPPPPPPPP 
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C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F— Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
KHLysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


4975 


18876 


A 


5012 


2 


80 


SNQNGTGGES I YGEKFEDENFHYKVK 


4976 


18877 


A 


5013 


53 


324 


FLVFFVPENSEKISLQLHLALTSNSSWV 
QSPSHLELMNLCRHINIRVDPTGLREGM 
HYTEVCGYDIASPNARSLFRGPITAVIA 
AQVNES 


4977 


18878 


A 


5014 


329 


1 


FF F F F S E TE S R S VAQAGVQ WRYLGS LQA 
PPPGFM 


4978 


18879 


A 


5015 


193 


32 


PLLILCSERLCHCTPAWAIERDSVSKTN 
KTKQQQQQQQNQGTKCMFLALLGQI 


4979 


18880 


A 


5016 


354 


235 


CGE IGMLLHCWWECKLVRPLWTNLFLRD 
GS TL T VGS KSSF 


4980 


18881 


A 


5017 


348 


2 


RLLVGKLMELHGEGSSSGKATGDE 


4981 


18882 


A 


5018 


76 


1 


IFFLSQVSPLSKEDAGEYECHASNS 


4982 


18883 


A 


5019 


68 


356 


YFGGVGGFFFFFFFFFWFGPSGGVFFVG 
VQAIFFFFFFFLGGFFFLVRDYFFVAVF 
WWGLLFFFVFFFWFCFVFFFFFGWKRK 
NIFYGWVFLFCW 


4983 


18884 


A 


5020 


1 


288 


FFLNLEKKFI KMLSS YTDNG I WATAED F 
MQNFKNLVGYHNS ITEENLPQLGANENL 
ESQSGNFSWFIFFNADRKRGMVLLLPN 
NEMTILKPRTSV 


4984 


18885 


A 


5021 


133 


1 


WATETMWSTKPKMWPGTGAHTCNPSTL 
GGQGGQITSPQKFKTS 


4985 


18886 


A 


5022 


2 


337 


RRS D PJNF F KNRIiRE RRKKQ KLAKE RAGL S 
KLPDLKDAEAAQKFFLEKIQLGEKLLAQ 
GEYEKGVDHLTNAXAVCGQPQQILHVLQ 
QTLPPPVFQMLLTKLPTISQRIVSAHSL 


4986 


18887 


A 


5023 


1 


325 


VDGCPANLIiS SURS LVLRAET I S LGEHP 
CDRGEQVTLFLFNDCLE IARKRRKVIGT 
FRSPHGQTRPPASLKHIHLMPIiSQIKKA 
LDI IETEDCHNAFALLVRPPTEQA 


4987 


18888 


A 


5024 


135 


2 


ATMFLNSKVSKYSGYLLGFHECREKGWM 
TWDGERDPSPGILQLQ 


4988 


18889 


A 


5025 


2 


94 


KTATKLiIGGXHYDSXNIKAIRDGLLARR 
HAL 


4989 


18890 


A 


5026 


2 


362 


QELERSMAQRCVCVLALVAMLLLVFPTV 
SRSMGPRSGEHQRASRIPSQFSKEERVA 
MKEALKVFPTWSTSFIQHEWEEYSHL 
FTIQGSDPSLQPYLLMAHFDWPAPEEG 
WEVPPFSG 


4990 


18891 


A 


5027 


222 


84 


AASTFCHVSQAGLELLTSDDPPASASQS 
AGITGMSHRTQPMVHIiY 


4991 


18892 


A ■ 


5028 


366 


0 


SGRS KKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKRGGR 


4992 


18893 


A 


5029 


231 


68 


AGMGSRALPKPLLMSHSSLKAVELPDS F 
SPELRSLIiEGLLQRDVNRRLGCLGRG 


4993 


18894 


A 


5030 


2 


366 


LNLGGRGCSELRSCHCTPAWVTETPFKK 
KKKVPHGGYFVAQLLPPRKMRTHRRPQA 
QGERTGPGWGAPVHTGLLPRRGQAFTQP 
PSAFKGFFHPTNKSSSEAKKKKKKPRVG 
RKKNVGYIL 


4994 


18895 


A 


5031 


241 


368 


QVERNFKS QSGAEAHI CDLS TLGGCGGQ 
ITRSGVHDQPCQHG 


4995 


18896 


A 


5032 


279 


396 


DGGMW P GT VAHACN P GTLGGQ GRW I TRS 
GVREQPGQHGE 


4996 


18897 


A 


5033 


173 


398 


SNESSIiRVNPFFFFFFLKTNFSFCPPGG 
RQGPNFSLLDPPPPKVKEI FCLTPPKRW 
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E YRPAP P P PRNFG F L I KTG 


4997 


18898 


A 


5034 


2 


143 


SNPPALTSQSMGITSMTRHALPIACFLV 
FVFLFFEMESHSLAQAEVQ 


4998 


18899 


A 


5035 


350 


2 


QEISSVQTSTQLFNGMTVKARATTREVM 
ATYTIEDIVIELIIQLPSNYPLGSIIVE 
SGKRVGVAVRLFIFITQKSFIFLFSFLT 
LCLCLQHFHNDFLLL I VP I L I AMAFLML 
TERK 


4999 


18900 


A 


5036 


1 


100 


HECGSSSQRTLSVQEAAAYLKVSNEIRI 
LIAIF 


5000 


18901 


A 


5037 


247 


334 


LQ I D I S AWAYT I AVKEDGELNLMKKAA 
S 


5001 


18902 


A 


5038 


2 


296 


DKAPMLKVI VNSLKNMINTFVP SGKVMQ 
WDEKLPGLLGNFPGPFEEEMKGIAAVT 
DIPLGKVHLEALKKKVIKFFYKFPLRCD 
IHTAKCTGLRYTAQ 


5002 


18903 


A 


5039 


152 


335 


RPIWSYYLLFFFLFWGLYLDWKRIFTF 
I LSEKYFDMKKNQCKEGLDI YKKFLTRM 
TRISE 


5003 


18904 


A 


5040 


336 


1 


GGLT I S SLLKEKEGSEVAKFTLEQLCL I 
CNIMSTAEYCVAATQQLEEKLKEKVDVS 
L FDR I NL S GDMGT F S P V I S S S I HLL VQD 
VDAACDPAM VAMS KMQ CQNVQHVG VKS S 


5004 


18905 


A 


5041 


3 


204 


LNLGGGGCSEPRSCHCTPSWVTARLHLK 
KKKKKKRGKNPLKNGGKENFKFLQILW 
PKNSLENLAV 


5005 


18906 


A 


5042 


382 


148 


W CNHRGPRSRKKRS EGSTKSRRLGAT I R 
MVTPHTTRTCARDPELTSKEKCVYIEEH 
THTHTH I YI HTHTHTC I YVKTH 


5006 


18907 


A 


5043 


137 


1 


RPRRRYMKKFSTSLI IKKMKIKITMKYH 
LSHL I PVKMAITKKTKD 


5007 


18908 


A 


5044 


198 


34 


KPASRFCHVGQADLELLTSSDLPASASQ 
SAG I TGVSHCSQ PNF I TLCLWTDHFF 


5008 


18909 


A 


5045 


11 


357 


LLTYFIIMFKILEIYEKEEQLIIKQISE 
GQEKVKELRQFKEHRKAKDSALQS i es k 
MLELTNRLRESQEEIQIMRKKKRKNAAR 
GPYWKFKYSQRWKSPPGPFQWGWGPRKN 
FFV 


5009 


18910 


A 


5046 


2 


193 


GLQLLTSSDPPASASQSAGIAGMSHCTQ 
PQVHLMPSLYHFRFLQVDTKDLLRASAD 
LIHRGIT 


5010 


18911 


A 


5047 


1 


196 


KLCLVMNSPMKAAPRNFSCAPSLSLLPF 
SFRKTTTTLTSNTADDFAYFDLCKMEPC 
SMVLFFVFC 


5011 


18912 


A 


5048 


386 


276 


AQAVLELLDSSDLPPSASQSAGISGVSH 
CTQPDSIF 


5012 


18913 


A 


5049 


1 


324 


VDAAAE KLEASTGWLMRFKERS CLHNI K 
MHGEATVADTEAAAGYPEDLAKITDKGV 
YTQQQ I LNGDE I AFC WQKI P CR I FLARE 
QAVPGCNAS KARLT VLLWANAAGD 


5013 


18914 


A 


5050 


1 


129 


PRSCHCFPAWVTEQDSVSQKKKKKKKRE 
KEKKKNPVEKSAKVI 


5014 


18915 


A 


5051 


193 


356 


RSFIPSASASASTLCLKCSQNGQPGAVA 
HACNPSTLGGQSRQLTRSGVQDQSGQY 


5015 


18916 


A 


5052 


192 


341 


AKRVKRKKTFFFFFLKQFCSVPQAGGRGP 
DPGSLKPLPPGLKGFSCPTPLN 


5016 


18917 


A 


5053 


340 


1 


KR I PDKPQKELRRLATKL I REAPEKGQA 
QGKEIHKSIQEAKGEXFKAIDRIKKSQF 
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nucleotide deletion, \=possible 
nucleotide insertion 














DRQEALDI LLVMQNALESLSNR I EQVEE 
RNSELEDKVFDLTRSSKAKGKRIRNYDH 
S 


5017 


18918 


A 


5054 


2 


383 


WKVATQPADNPLDVLSRKLHLGPNVGR 
DVPRLSLPGKLVFPSSTGSHFFMLGIGD 
IVMPGLLLCFVLRYDNYKKQASGDSCGA 
PGPANI SGRMQKVS YFHCTLIGYFVGLL 
TATVASRIHRAAQPC 


5018 


18919 


A 


5055 


2 


383 


GSVhS KKGDYLKYHYNASLLDGTLLDST 
WNXjGKTYNIVLGSGQWLGMDMGLKEMC 
VGEKRTVI I PPHLGYGEAGVDGEVPGSA 
VLMFDIELLELVAGLPEGYMFIWNGEVS 
PNLFEEIDKDGNGEV 


5019 


18920 


A 


5056 


2 


383 


AVIDEVRTGT YRQLFHPEQL I TGKEDAA 
NNYARGHYT IGKEN IDLVLDR I RKLAD Q 
CTGLQGFL VFHS FGGGTGS GFTSLLMER 
LiSVDYGKKSKLEFS IYPAPQVFTAWEP 
YNFILTTHTTLEHSD 


5020 


18921 


A 


5057 


26 


452 


KLiMSLRLRQAWHEAAIDEFRTGTYRQLF 
HPEQLI TGQEDAANNYARGH YT I GKEN I 
DLGVDRIRKLADQSTGLQGFLVFHSFGG 
GTGSGFTSLLMERLSVDYGKKSKLEFFI 
YPAPHVFTAWE P YNF I LTTHTTL EHS D 
CA 


5021 


18922 


A 


5058 


2 


385 


AVIDEVRTGT YRQL FHPEQL I TGKEDAA 
NNYARGHYTIGKE I IDLVLDRI PKLADQ 
CTGLQGFLVFHSFGGGTGSGFTSLLMER 
LSVD YGKKS KLE FY I YPEPHVCTAWE P 
YNFILTTHTNLEHSDC 


5022 


18923 


A 


5059 


3 


390 


GDAANNYARGHYT I GKE 1 1 DLVLDRIRK 
LADQCTGLQGFLVFHSFGGGTGSGFTSL 
LMERL S VD YGKKS KLE FS I Y PAPQ VS TA 
WEPYlsrSILTTHTTLEHSDCAFMVDNEA 
I YD I CRRNLD I ERPTYT 


5023 


18924 


A 


5060 


103 


3 


KIFFFLRWSFALITQAGVQWRGLGSLQP 
LPRAT 


5024 


18925 


A 


5061 


262 


412 


KASPFRTGTAFGNGKTSDYLLLGNFGYT 
FGGITGCLKAGLETSYWTWFTH 


5025 


18926 


A 


5062 


293 


406 


VUGS I FE VIWAVI KPGTSFGI S VLRAL 
RIiLRI FKVTK 


5026 


18927 


A 


5063 


419 


15 


WEEEGPLPKKKKGGFSNKRGNI IMGPFX. 
KRPPEKKPPPPPRKINPFFFFFKRKGPP 
PPRVKGKGAIKGSRNPPLFGSRDFFSPA 
PQKSGAPGGPPPPPVNKNQFFFFFFLVE 
TGFHHVTQAGPELLSSSSPPTI 


5027 


18928 


A 


5064 


2 


216 


GliTNLFIFCRDRILLCCPGWSQTPSLKQ 
SSHLSLPKPWDYMYEPLCLYFLYPWFCL 
SLSLS FPLSHT YFFG 


5028 


18929 


A 


5065 


259 


36 


S Q PHWD YRASRAE E IATFLEVWLQAIFV 
CCLFLRQSPFIAQAGVQWHYLKSLQPPP 
PGLKQFSCFSLLLVPRLA 


5029 


18930 


A 


5066 


2 


404 


GKGAPTTSLISVAVTKIIAKVLEDNKLP 
GAI CSLTCGGANIGTAMAKDERVNLIjSF 
TGSTQVGKQVGLMVQERFGRSLLELGGN 
NAI IAFEDADLS L I VPS ALFAAVGTAGQ 
RCTTARRL.F IHES IHDEWNRL 


5030 


18931 


A 


5067 


1 


400 


GENM I TGT S QAD CAVLNVAAGVGE FE AG 
I S KNGRTREHALLAYTLGVKQLIDGVNK 
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MDSTEPPYSQKRYEEIVKEVSTYIKKIG 
YD PDT IAFVP I SGWNGDNMLE PRANI PL 
VTGWKATPIDCDASGTTLLDA 


5031 


18932 


A 


5068 


41 


392 


GSPHHPCAHIERKKKPYNSltflGFYTKRN 
ALRVAE VWMDD YKSHVY I AWNLPLENPG 
IDIGDVSERRALRKRLKCKNFQWYLDHV 
YPEMRRYNNTVAYGELRNNKAKDVCLDQ 
GPLEN 


5032 


18933 


A 


5069 


3 


395 


GTPTRPHILLQALFIRAMLPCPDYDTDT 
KTGLDQAIiKI CQAMLDEAANQGWLVT VL 
NNTWLIQMA1QGRWVXDSSLLTLPYIEN 
HHLHLFKKRKPIMKGPHAKSRTSIECLP 
ELIHACGGKDHVFRSLVQQ 


5033 


18934 


A 


5070 


3 


405 


PRASEVCGFSCHITCVNKAPTTCPVPPE 
QTKGPLGIDPQKGIGTAYEGHVRIPKPA 
GVKKGWQRALAIVCDFKLFLYDIAEGKA 
SQPSWISQVIDMRDEEFSVSSV1ASDV 
THASRKDIPCIFRVTASQLSAS 


5034 


18935 


A 


5071 


3 


393 


ITRQEFIDGIIiASKTFPTTTIEMTVGADI 
FDREGDGYIDYYEFGAALHPNKDAYRPT 
SDAPKTFHQGTRQVAQC I CAKRFLVEH I 
GENKYRFFLCNHFGDSHQMRLVRILLST 
VMVLDGGGWMALDQFLTT 


5035 


18936 


A 


5072 


3 


394 


I TREE FI DG I LAS KFPTTK I KMT VGAD I 
FDRDGDGYIDYYEFVAALHPNKDAYRPT 
SDAYKIEDEVTRQVAQCKCAKRLLAEHI 
GENKYRFFLGNQFGDSHQLRLVRILRST 
VMVP VGGGWMALDE FLVND 


5036 


18937 


A 


5073 


1 


393 


GEDAANNYSRGH YT IGKE I IDLGLDR I C 
KLAD Q CTGLQG FL VFHR FGGGTG S G VTS 
IjLMEHLSPDYGPKSKLEFS I YPAPQVFT 
AVHE P YNSMLTTHTTLEHS D CAFMGDNE 
AIYDICRTNLDIERTTYTN 


5037 


18938 


A 


5074 


39 


482 


LGLHSAWRDDKIGYNPDTVS FVP ISGWN 
GDNMLE P S ANMPWFKGWKVTRNDGNAS G 
TTLLEALDCILPPTRATDKPLRLPLQDV 
YK I GG I GT VP VGR VE TG VLKP GMVATFA 
PVNDTTEVKSVEMHHEALSEALPGDNVG 
FNVKNVSV 


5038 


18939 


A 


5075 


1 


390 


GVSMAVSIiVIIYQYWRNMPDPHNLPIV 
AGW KKYP L FFGTAVFAFEG I RWLPLEN 
QMKE S KRFPQALN I GMGIGTTLYVTLAT 
IiGYMCFHDEIKGSITLNLPQDVWVYQSV 
KIL YS FGI FVTYS IQFYV 


5039 


18940 


A 


5076 


330 


462 


VNFFFREGGTESPS IAQARVPWCSLNSL 
QPLPPGFKRFSCLSLP 


5040 


18941 


A 


5077 


1 


403 


GGWGVEAEDFE YAPDVEPLEPTLSNI IE 
QRSLKW I FLGGKGGVGKTTCS CSLAVQL 
S KGRE S VXi 1 1 FTD P AfflSFI S HAFD Q KVS K 
VP T KVKG YDNL FAME I DP S LGVAE LPD K 
FFEEDNMLSMGKKMMQEAMSAF 


5041 


18942 


A 


5078 


348 


1 


RPQGNNI ERWQGGE Vlil IQLLR IAKSVK 
NVQLLWKTQFLIKI1HIL1LFYDPRIPFVS 
NYPKEKKTYVHKKTYAQIFISLPRTFLF 
LRQSLNMLPRQDLS S 


5042 


18943 


A 


5079 


248 


3 


PGQVLEREHTCRRPWAWNVCICLAGPA 
KPNREDVSS SQEESLQLNS I PPTPTLTS 
TAVKSRQPLWGLKEMEEEDGSELDF 
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Amino acid sequence (A~AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine> M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=VaIine, 
W=Tryptophan, Y=Ty rosin e, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5043 


18944 


A 


5080 


3 


192 


GDGSELRSCHCTPAWLTERDTSSQKKKK 
KEKKTWNPAPELFFFLLLEKKGQNQGKP 
NMRKVL 


5044 


18945 


A 


5081 


10 


171 


CFCLGPVGVCRSLLKPQLCFLGFPWLQD 
ARQ KFRS VL VE AT VKLDEL VTRS TRP 


5045 


18946 


A 


5082 


94 


1 


DFWPGKVAHACNPSTLGGRGGWITRSGV 
QDT 


5046 


18947 


A 


5083 


142 


3 


AAS TGS HS VAQAG VQ WKDL S SLQPLPPG 
FKRFSFLSLPSSWGRKIA 


5047 


18948 


A 


5084 


257 


177 


I S LGE VAHACNP S TLGGRGGQ I TRPG 


5048 


18949 


A 


5085 


143 


1 


GGESHSDTQAGVQWCYLGSTQPPLTLTF 
KQFSCLNLPSSWNYRCVPP 


5049 


18950 


A 


5086 


3 


87 


H VGQAGL VL LAS CDP P S LAS QSDGITV 


5050 


18951 


A 


5087 


375 


220 


LHSLSQHIATRSKKPAFNLFAFGTLSPV 
STNWVGFNPISFEGLMLIIFFFFF 


5051 


18952 


A 


5088 


40 


203 


I FLP LLL VS F F FY I LT WGFTF FWGLTL I 
NDLLNFFFGNSGI S FS WFGSRAGELG 


5052 


18953 


A 


5089 


1 


145 


MDMRVPAQLLGLLLLWLSGARCD I QMTQ 
S P S S LS AS VGDREDGD VDAA 


5053 


18954 


A 


5090 


146 


328 


FMPFPLLESLEIFSSKLFNPPNLFFFFE 
TE S C S VAQD GVQ W YDLGS LQ P P S PGFKR 
LSCLS 


5054 


18955 


A 


5091 


338 


1 


PNLPSVQLPPTTSCTPLIGDLLVARAHF 
AGWKHQGINS PSL PMAPREGME APRNP F 
HPHI FLCLI HGFHLAVS KLYSL FYYYYY 
YFSRWSFTLVAQAGVPWRDLGSLQPLPP 


5055 


18956 


A 


5092 


2 


327 


PPLTPPIFPSSPKNKNPPPPHYNPPPPP 
FRTPPPPPYPPLSPPKSPPPPPRVDPPL 
PYS I FPRPNLIS PPPYS PFYLLPQLQAF 
P 


5056 


18957 


A 


5093 


133 


3 


AQTCTPSTQINSKWITDLNVKGKTIKLL 
KDNIREKLDDLGCGE 


5057 


18958 


A 


5094 


170 


350 


AGGQGGNFYSLQPNPPWIRESSHSTLPK 
FWNYRHTPTGPPNFGFFFLKMGFGLVAQ 
TXFN 


5058 


18959 


A 


5095 


133 


274 


RD I I I FLESGI KGYFLFFSDKI TSELVS 
KIGDKNWKIRKEGLDEVAG 


5059 


18960 


A 


5096 


2 


188 


RE MQ I KT T VR YHLT TARMVL I KKS EKNR 
CWHGCSEKGTLLHCWWECKLLQPLWKIG 
WRVDAA 


5060 


18961 


A 


5097 


235 


81 


FPKKKKAQIKKPQAAQSLNLSLSLYIYA 
FT YTHTHRHTHTQ THTHT Y I H I S 


5061 


18962 


A 


5098 


355 


169 


KILLLITDSCAHISCRFSECMKYFNFFF 
HPHSVYVTDDKIRLLEEQLPHVFSNKME 
PFKVCN 


5062 


18963 


A 


5099 


160 


1 


FSAS Y FNS KKKL I F LLLKLRWNF F VC F F 
GME S HS VAQ AGVQ W CHLGS LQ PL P S 


5063 


18964 


A 


5100 


364 


151 


QMGFCHIGQADLELLTSSDPSASASQSA 
GI TGVSHYASQE FLKKEFHS AHL I PLQG 
TCVQGKTAQPYSEAL 


5064 


18965 


A 


5101 


205 


3 


AASPGVQWPDSRLLQPRPFGVKQFFPLS 
FPSSWGYRGAAPPPGHFFFFFPSRDEIS 
VCLPGWSPTPE 


5065 


18966 


A 


5102 


349 


207 


DGASLCCPDWGLELLSSSGLPALALQNP 
EITGVGHHTQPWTLSLLIY 


5066 


18967 


A 


5103 


425 


0 


CEFFKNKDFCFhSKKKKKKKKKKKKKKK 
KKKQSSSQTSYS 
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Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyla!anine, 
G=GIycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y-Tyrosine, 
X=XJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5067 


18968 


A 


5104 


437 


184 


LFLFFFPPPKKVFSQKTPRSVYSPPPKK 
KKKNS P P PLKLGP PQE I FKTP PPPIFFF 
FFFMFFFSFFFFFKGHRPRFISLWSPPL 


5068 


18969 


A 


5105 


401 


10 


MFSPPFGGPRGGVPLRPGFKNPPGPKGK 
T P F FL KNQN YLG P PLY S P F LERLNKKKA 
FTLEAKGPNNPNSSRSPPPGAQKKNPPF 
KKKKKKKRKI I FMPQAKPAVYWCSGNQ 
LPITFPYSIISGPLSSFT 


5069 


18970 


A 


5106 


413 


2 


PNKKPSPPPFSPPHKTFSSPSPPPPPNK 
PTPTPPPPNKKNFPPPQPPPKKKNPPQT 
PPPQKKTPPPPKKKKHPPPPHKKTPPPP 
HTKNPPPPPPLPPSLSIPPLFPLYINSP 
FFIPFFFFFFFKIFFYLGRVGGRV 


5070 


18971 


A 


5107 


269 


115 


GFFFFFFFSFFFFFFFFFFFFFFFFFFF 
FFFFFFIKKCHFIIFKYIGSTSN" 


5071 


18972 


A 


5108 


238 


4 


QWLPLVDRIWVILLFFDSLNLKMFPQYQ 
IGLQKIKVKNNTTFYFLFSFFLRWSFTL 
VTQAGVQWCDLGPLQPPPPRFK 


5072 


18973 


A 


5109 


496 


0 


STPSRASPKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKSSK 


5073 


18974 


A 


5110 


1 


243 


DHLSLGGRGCSEQRSCHCTPAWVTERDS 
VS KKKKS FRALD I F FFNGKKI LGTLRGK 
NQKPPREVSSGNYFHFKGLFGPCYK 


5074 


18975 


A 


5111 


78 


4 


LGMVAHACNPSTLGGRGGRITRSGV 


5075 


18976 


A 


5112 


2 


96 


AGVQWRDLGSLQPPPPGFEQFSCLSTMP 
NFL 


5076 


18977 


A 


5113 


128 


6 


IYKDVLEPGVLRLLDVDNRVVLPIEAPI 
RIIIPSQDVLHS 


5077 


18978 


A 


5114 


132 


2 


DDLSSLQPLPPRFKRFCCLSLLDSWDYR 
RPPLRPAIQEAEAAE 


5078 


18979 


A 


5115 


1 


146 


AVE TGF CHVAKAALE LLH S ND P LAS AS Q 
SAGITGMSHRAWPSLFSILS 


5079 


18980 


A 


5116 


213 


3 


GAVGFSSGCFLLYLYALLGPFPPLPLLQ 
LKFICWPGAVAHVCNPSTLGGRGGRITR 
SGVRDQPDQHDET 


5080 


18981 


A 


5117 


407 


0 


LGWAWWFTPVIPTLREVKAGGSPEVRS 


5081 


18982 


A 


5118 


215 


414 


K I RP KE I RNN GGS QQRKRAQ LG KLE Q TN 
P VLNFKAPFFGEQGD Y YKS FFKTCLDNL 
PRRGKGVFFF 


5082 


18983 


A 


5119 


40 


326 


KKKKKKKKKKKKFKKTGGAKWAP FLNLK 
KKG I KPQKKKGFFQGKKKRGICKKKI PKI 
LIQGFFQKKKNFSKKKISQKKKKNGFWP 
LGKGFCPKKKT 


5083 


18984 


A 


5120 


99 


2 


GRSTRHS PAHTHTHTLRHTGTLLHTHSH 
RHSP 


5084 


18985 


A 


5121 


3 


219 


ELRLCHCTPTWATRVRLSQKKKREKKKK 
NTFVGKKKGKKGKKP PNPKKKGSGLGFKG 
FLFS KFFKTRGTT 1 1 


5085 


18986 


A 


5122 


262 


351 


TRPGTLAHACNPS TLGGRGGW I MRSGVQ 
DQ 


5086 


18987 


A 


5123 


340 


0 


KKKKI S QAW WWVP VVP ATWE AE VG 


5087 


18988 


A 


5124 


94 


2 


LMP I I PALWE AE VGGAPGQE FQTS LANM 
VK 


5088 


18989 


A 


5125 


310 


68 


LSPFYKLGFWGRVIYPPPPPKVSPPVFG 
APPPPEKKKKPPPPPPKSS IKKPTEKKI 
CLAASGKGLEVCFCKKPLI QS YFL 


5089 


18990 


A 


5126 


343 


131 


SRPRRPGLELLTSDDLPAPATQSAGITG 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=PhenyIaIanine, 
G^GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L—Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, ' =Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VSNHTQ I WP ISVGDLMNTGWI PENRI V 
SELLREIHFLRICL 


5090 


18991 


A 


5127 


2 


145 


FSLLGSSVGRLEGTGTISTHCTLPASRD 
S PASNYRLTPP CPADFL YF 


5091 


18992 


A 


5128 


1 


130 


SRSVTRLECSGTISAHCNLPGSSDSPAS 
DYRLTPP CPADFL YF 


5092 


18993 


A 


5129 


1 


338 


WRKEGIRFEKEKSKDFKNHVIKYLETL 
LYSQQQVCKLWVTFI 


5093 


18994 


A 


5130 


346 


3 


DFFFFHPILFFFFFKKTLFFSPPEGNGG 
HTLFLKNTPPRGRI FFFLPPLLLWGGWA 
PPPPPKKNFFFFFFFFFFFFFFFFFFFF 
FFFFFFFMPPTQSLFKDQEGPVQGGEEA 
PV 


5094 


18995 


A 


5131 


3 


114 


FLHIGQAGLELLTSDDPPASASQSAGIT 
GMSHHAQP 


5095 


18996 


A 


5132 


292 


180 


AASTDSIEGHGASLPSKKTPSEEDFETI 
KLISNGAYG 


5096 


18997 


A 


5133 


338 


216 


RGENRL P P GGRGF SGPKSHFCP P AWATE 
RDSLSKKKKNPV 


5097 


18998 


A 


5134 


337 


3 


I WLGNGKGI IRNNQ I FSNKE AG I YIL Y 
HGNPWS GNH I F KGRAAG I AVNENGKGL 
I TENVI RENQWGG VD I RRGG I P VLRSTL 
I CFGYS DG VWGDEGKGLI EGNP I SAN 


5098 


\S999 


A 


5135 


227 


343 


NYVKEKLIPTWNWMVS IMDSTEAQLPYG 
SALTS WD PGQ 


5099 


19000 


A 


5136 


166 


375 


ATFVSLGVFGSAVILLYFKNMMKLDLSL 
TPHTTINSKWIKRLHTRPEAIKFLEVNI 
GKKFFDIGHEIIL 


5100 


19001 


A 


5137 


90 


2 


KTTLWPGAVAHACNPS ILGGRGGRITRS 
G 


5101 


19002 


A 


5138 


213 


379 


PTFQSVGETGSLLKMHILGPGAVAHACN 
PSTLGGRGGRITRSGVQDQPGQYGETP 


5102 


19003 


A 


5139 


384 


2 


FFFFFFLSVMESCSATQAGVQWCDLSS 


5103 


19004 


A 


5140 


41 


317 


TMSRDRPSDKTWTYNRSNWMPDDGAPF 
RYSFSALKDRHNAVEVNWIDPNNGWETA 
TELVEDTQAIARYGRNVTKMDAFGCTSR 
GQAHRAGL 


5104 


19005 


A 


5141 


29 


523 


VKAEAAYKKADD I WNLRKDDYFVNDEAR 
ARYWDDREKARLALEAARKKAEQQTQQD 
KNAQQQSDTEASRLKYTEEAQKAYERLQ 
TPLEKYTARQEELNKALKDGKILQADYN 
TLMAAAXKDYEATLKKPKQS SVKVSAGD 
RQEDSAHAALLTLQAELRTLEKHAG 


5105 


19006 


A 


5142 


432 


144 


GGFLPKAFLPPKKKKGAFFPPPGKGAIP 
PLF WGGW T VFLI P E KNP PRGFP P P KRGE 
KPVGGPFFRIFAPPPQKKIGGGGFFLER 
FFFWDWGKTGP 


5106 


19007 


A 


5143 


403 


6 


FFFFKKQIGPRGGGPPQKSPPLGGGGRG 
GSPRPGVLTPLGPQGKNLFFLKKKKKLG 
GGGGPRNPNFLGGGPGKSLYPGGQRFQG 
PK1LPFFPPRGKKKKIFFPKKKKKKSHR 
KL I QTLI LQTGKLMKMLWRT 


5107 


19008 


A 


5144 


115 


427 


KKKKKKKKKKKKKKKGGGPKKKKNFSPG 
GGKKKFFFNGAKKKKPRAPVKKTGEKKK 
GGKKKKKCFEKNPFFGGGKKKKKKCSSS 
YPCPKKQ 


5108 


19009 


A 


5145 


2 


79 


GCSEPRSCHCTPAWVTEGDSISKKK 


5109 


19010 


A 


5146 


317 


444 


IHQP VC VAHAYNP S TLGGRGGH I TRS GD 
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X=Un known, *=Stop codon, /=possib!e 
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nucleotide insertion 














RDHPGQHGETPYPL 


5110 


19011 


A 


5147 


3 


413 


S FYRGF I PVLQTVTVLI IGDPSLHGDAW 
S WME F FLTVI F I AL WMLPL I GLT I WNA 
I WFQD I ADLAFE VS WTKPHP FP S VSQ I I 
ADMLFPIiLLQALFL I QGIAVS LFP IHLV 
GQLDSLLHMTLLYSLYCSEYRLVIN 


5111 


19012 


A 


5148 


409 


247 


QTKFRHVAQAGPHFLGSSDVPTSASQSV 
G I TGM S LHAQHQHF FT ASMGRAFGEN 


5112 


19013 


A 


5149 


284 


1 


AEASRAHMEAHARTQGTPGS KAEGVGP V 
EE YR I WRRPGPDRAHASDS EE SGEPGGP 
PPHPANFVFLVETGFFRVGQAGLEFPTS 
GDPAASASPV 


5113 


19014 


A 


5150 


439 


120 


FGSPRLECSGVISAHCNLRLLGSSDSPA 
S ASHVAGI TARPNARI IKAFGAPSTCTV 
VWERHEQQTCIQRSDLPAGRQKKTGPKN 
RPGLVAFPFNASPLGGPGGRIA 


5114 


19015 


A 


5151 


415 


0 


FFFFFFFFFFFFLPFVQSFYIYIYMYI 


5115 


19016 


A 


5152 


67 


179 


ICLSLFLTVHVCVCVCVCVCVCVCVCEM 
VFGVSIPCN 


5116 


19017 


A 


5153 


280 


2 


KTPFFLNLKEKTRAPFFLKKTFFKKKPF 
KI FFS KKRNFFFFP P PGKRP PTL FKKKF 
FFFFQKKKNFFFS P KKKKKKKKKKGRRS 
RSRTSPRV 


5117 


19018 


A 


5154 


296 


406 


DRVLLCHSSCSEVAESRLTAASTSWAQA 
ILLPQPPEW 


5118 


19019 


A 


5155 


114 


402 


WRIKASVEKKKKKKKKKKQKKKKKKKR 


5119 


19020 


A 


5156 


110 


1 


QRFLL TLKS Ii I LRLG AVAHACNP S TLGG 
QGGRIMRS 


5120 


19021 


A 


5157 


382 


95 


SSSSPHFLTPSQLRLFFPFFPLKIFFFP 
KAFNFCGGVFPIFSPPKKKFFFQNSQVG 
FKNPPQKEKNFSFPPPVKFGPPKGFFKR 
PPPFFFFFFFFF 


5121 


19022 


A 


5158 


2 


350 


TLQPGRQSETPFQKKKKKKKKKKKKKKK 
KKKKKKRGGGFLKNLGGGP KI WGVEKFF 
F FFFGGGYKKPLGFFLE KI F FLGGGIWA 
PPPPKKISFWGKKKFFWGGGGKTPLFFC 
GGKK 


5122 


19023 


A 


5159 


1 


208 


PTRTL YLVKLGLMGPAEIFLS S FQEIRL 
I D YD KMVDHRGVR WAFGQ WAGVAGKYA 
RAYI THFLNWLGE 


5123 


19024 


A 


5160 


1 


208 


PTRPLYLLKLSLMGPVEIFLSSFQEIRL 
I DYE KMVDHRGVR WAFGQW AG VAGKYA 
RAYITHFLNWLGE 


5124 


19025 


A 


5161 


419 


25 


AQKKKKKKKKKKKKKGGGPLKKNFGGGK 
NKRGKKKKNFFLKRGGKKKTGGILEKKN 
FFGGGKKGEKPPQKKKALKEKKKFLRGK 
GGKKP LNWGGKKKW 


5125 


19026 


A 


5162 


475 


150 


KPRTSGTARVPGDPPPRCLDDKRLPSRP 
VI FVFLAETECDRVGQDGLQLLTSGDHP 
HLTFPNCYNYRREPPRPTSSGFQHDIET 
LH YRRGKADRFKAHIi P K 


5126 


19027 


A 


5163 


237 


409 


FHNQKT YLDS ALCTLVYFVVF VLRDGI F 
LLPQLECSGTVMAHCNLKLLGSRSPPAL 
A 


5127 


19028 


A 


5164 


373 


476 


LGFILLSFLETRSHSVAQAEVQWCDHGL 
LQPQTP 


5128 


19029 


A 


5165 


374 


132 


I F YSPFLYYHMLGKYHKI KNLKI KHMWL 
DAVAHACNP S TLGGRGGRITRLGDRDHP 
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K=Lysine, l^=Leucine, M=Methionine, 
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GQHDETPSLLFLWTFASSLSGFPL 


5129 


19030 


A 


5166 


332 


2 


RPQGLANCFPGALPPTKSEGFPGQSPGF 
QIWGGKAFFLRGFPFFPQVESKGAVSPP 
CKVPPPGLRPFSAPTPPKKTGDKRGPPPG 
KAIW r r c h vr JUV hi iorr HHVoJnUCjIjj 


5130 


19031 


A 


5167 


238 


1 


GGFTGTGGKYEPAGIVHRGEFVFTKEAT 
SRI GVGNL YRLMRG YATGG Y VGTPGS MA 

t> c~* n c* *a r~i i~y rn th t? /*^~rtvtt ta~i <n 7\ tt» o t~> n 

U&R b QAS (J 1 b hi QNNHijh AtxES RL* 


5131 


19032 


A 


5168 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5132 


19033 


A 


5169 


3 


127 


DED FS D FDEKADDEDFVP S DAS PPKTKT 
SPKQVSYLIWVLS 


5133 


19034 


A 


5170 


20 


254 


IAPPARIRHEERERERERERERERERER 
ERERERERDTRAEWRARLFFFFFFFLPA 
DQCFFIKPSGQRFLPPGGGGVP 


5134 


19035 


A 


5171 


36 


479 


TXGXDNKKDLL I S VGDLVDRGAENAECL 
ELITFPWFRAVRGNHEQMMIDGLSERGN 
VNHWLLNGGGWVFNLDYDKE I LAKALAH 
TADELPLI IELVSGDKKYVI CHADYPFD 
E YE F GKP VDHHQ V I WNRE R I SNS QDG I V 
KEIKGADT 


5135 


19036 


A 


5172 


278 


448 


RFLCPLLHPFFSSETKSLTLLPRLGCSG 
VI IAHCSLKLLGSSNPPTSASRVAGMTG 
S 


5136 


19037 


A 


5173 


205 


1 


HARL I FVFLVETGFHCVGQAGLELLTSG 
DPPSASQSETPSPLKILKSAGCGGANST 
ALASGLLFSLGP 


5137 


19038 


A 


5174 


409 


194 


FFFLR WS LALS ARL VQWRDLGS RQAP P P 
GFTPFSCLSLLS I S FQPS ATYMRFLNT I 
TIVEYSFAYFPISQP 


5138 


19039 


A 


5176 


339 


472 


AKFT C I S S N I KL SNT RPGT VAHACG P S T 
LGGRGGQ I TGSGDGDH 


5139 


19040 


A 


"5177 


426 


148 


GGRGPPPLSHPFGGGGGGGPLRAGGQKN 
PGPKGETPFFGQTQKKNRGGATPPFSQK 
LGGEKHKNSFTPGKENSIRFDFPPAPPT 
WGEKKNFFF 


5140 


19041 


A 


5178 


328 


474 


NNTEYNKNRLGAVAHACNPSTLGGRGRW 
ITRSRDRISP 


5141 


19042 


A 


5179 


448 


330 


E TG FCHVGQAGL E L LGS SDL PAS AS QS A 
GIIGVSPHAWQ 


5142 


19043 


A 


5180 


259 


14 


LRYKAILCSWIRTTNIVKMAELPKVIYI 
FNAI P I FKIPDMFCRIARF I LKCMWNVK 
VS T I DKT I FKKENR VG I P P RVLMW 


5143 


19044 


A 


5181 


2 


423 


KFYATLVRYVGDRKNLVCREMSMALLSN 
LAQGDALAARAI AVQKGS IGNL I S FLED 
GVTMAQYQQSQHNLMHMQPPPLEPPSVD 
MMCRAAKALLAMARVDENRSEFLLHEGR 
LLDISISAVLNSLVASVICDVLFQIGQL 


5144 


19045 


A 


5182 


182 


312 


F F I Q VGFHHVAQAGL KLPS S S S P PHLAS 
QSAGI TGVSHCAQPG 


C i Azr 
D 1*1 J 








J\) 1 




nT13T7T7TrTrVT.PPC]T?AT,\/2\n'nri\7'nWP DT,nQ 
uj. ire L' r V r JUJXKo £?x^-U v±\\^U\j V y WK.L-'XjVjjo 

PQPPPPGLKQFFRNSTAWTKGLQPFPRL 


5146 


19047 


A 


5184 


216 


413 


KPXXKKKKKKKKKKKKKKKKKKXGGAVK 
KKGPQKTPVKPPGGFFFAPPLGVSPPPG 
VFFAGGGAPP 


5147 


19048 


A 


5185 


357 


465 


IKPTKILAPGPGAVAHACNPSTLGGQGG 
WITRSGD 


5148 


19049 


A 


5186 


467 


0 


FFFFFFFFFRFLFFFFFSEKK 
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NO: of 
nucleotide 
sequence 


SEQ ID 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine 7 
X— Unknown, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5149 


19050 


A 


5187 


346 


3 


RFLPLGQGGVEILTPRSTCLGPPKWWDY 
KGEPLRPAQKCFFFKKKRRRKKVQNSVA 
VYMVYHSMCGENKLRWRWWGI FTHTHTH 
THTHTQ KKQN I L»S RTH KKLV I T TGS E E G 
TL 


5150 


19051 


A 


5188 


11 


109 


GETP S LLKVQNIXWAWWHMP VI PAMWEA 
EARES 


5151 


19052 


A 


5189 


3 


151 


QLLRRPRQENHLNPGSRGCSE PRLHHCT 
PAWATVQDS I SNTNNWNNKCP 


5152 


19053 


A 


5190 


2 


373 


AVAAD KGVP L YRH I ANLAGN PD L I L P VP 
AFS V I NGGS HAGNKLAMQ E FM I LP VGAS 
SFKEAMRIGAEVYHHLRGVIKAKYGKDA 
TNVGDEGGFGPI ILDWNDALELVKTAIQ 
AAGYPDKGAIGR 


5153 


19054 


A 


5191 


55 


421 


ARVAD VCE S MKEHLLVLVE RAKY I P G F C 
ELPLDDQGALVTAHAGEHLLLGATKRSM 
VFKDVIiLLGNDYIVPSHCPELAEMSRVS 
IRILDELVLPFQEbHIDDNEYAYLKAII 
FFDPDKETEA 


5154 


19055 


A 


5192 


163 


466 


TCFLGSQSAGITSVSHCPSREVFFLKLI 
HWRQGGQVALLVATPHS PCCPQYRLAP I 
PRARHD FACAS L I FAC I LLVHVLLM P RV 
S AGRG VGVR PAG I QAGR 


5155 


19056 


A 


5193 


470 


0 


ECLWPGTVAHASNPTTLGGRGGRIMK 


5156 


19057 


A 


5194 


491 


281 


RGLALLPMGQCSGMMLAHGSLDLLDSSD 
PPASASQSVGIMGVSHHAWPSLVNISFV 
CLIHRSPKTKPEGR 


5157 


19058 


A 


5195 


3 


222 


LS FHSLHRCLYMLGTTS ENVS PFSLLEL 
LSRLATLLGDYCGSLSEGTI SRNVALVY 
ELLDE VLtRLE S RC VAQA 


5158 


19059 


A 


5196 


20 


191 


STWWNSSRGGGCSEPRSHHCTPAWVTER 
DSVSKTKKKINNKPALALPAVHPTSHPG 
S 


5159 


19060 


A 


5197 


460 


350 


RRLRRENHLNPGGGGCSELRSSHCTPAW 
VTERGPSD 


5160 


19061 


A 


5198 


174 


41 


SLGELLTECLFETKSHSVTQAGVQWRHP 
GSLQPPPQGFKRI PPH 


5161 


19062 


A 


5199 


131 


454 


FEYFKNRKPFFFFFFFGKGVSFCAPGGK 
AGDPFKLREPPPPRVKGFFGPPPPSGGN 
NGPPPPPRVFFWFFKKRGGSPFWPGGVL 
TPGGGPPPPAPQRGGINGLDPPAR 


5162 • 


19063 


A 


5200 


168 


51 


TVKTP IRPGAVAtiACNLSTLGGRGMW I T 
RSGARAQPNR 


5163 


19064 


A 


5201 


414 


497 


LLFXXXXDNGS IYNPEVLDITEETLHSR 


5164 


19065 


A 


5202 


381 


614 


VQPGAPPEWVALSTCPSAAPEGSQQPYI 
PPTFCFFPVKWLLSVTWLYLFIYFLSWI 
SVAQAGVQWCNHGS LQPRPS RL 


5165 


19066 


A 


5203 


377 


485 


PKEPGPQPCAPQPQPHAPQPQPRAPQPQ 
PCAPQPQP 


5166 


19067 


A 


5204 


54 


434 


MRTPENLELTOPQEFGSSWAAVECPDTL 
U PKJJLYLL. V J_»N F IjK. h, Jr r AKrvbj C b 1 J_i1j b li V Jb 
EICHPWDVTWFYSNCLTDTCGCSQGGD 
CEC F CAS VS AYAHQ C CQHGVAVDWRT PR 
LCRECPRQSPEGTE 


5167 


19068 


A 


5205 


201 


418 


GDAGAHAVLRLLPGHPGTSHLPFSSLTT 
S PLGLKRVTGGFNS KNRCDART YCYLLP 
TFAFAHKDRDVQDETY 


5168 


19069 


A 


5206 


27 


177 


NSFHHAGQSGLELLTSSDPPASASQSAG 
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M 
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nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, BNHistidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unkno>vn, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ITGVSHRAQPKAGTFLSPFHR 


5169 


19070 


A 


5207 


6 


1175 


DLGPHTPAWTRPKREDLVYQSTVRLPEV 
RISDNGPYEGHVGIYDRATREKWLiASG 
NIFLNVMAPPTSIEWA&DTPAPFSRYQ 
AQNFTLVC I VSGGKPAPMVYFKRDGEP I 
DAVPLSEPPAASSGPLQDSRPFRSLLHR 
DLDDTKMQKSLSLLDAENRGGRPYTERP 
SRGLTPDPNILLQPTTENIPETWSREF 
PRWVHSAEPTYFLRHSRTPSSDGTVEVR 
ALLTWTLNPQIDNEALFSCEVKHPALSM 
PMQAE VTL VAPKGP KI VMT P S RARVGDT 
VR ILVHGFQNE VFPEPMFTWTRVGSRLL 
DGSAEFDGKELVLERVPAELNGSMYRCT 
AQNPLGSTDTHTRL I VFENPNI PRGTED 
SNGSIGPTGARLTLVLALTVILELT 


5170 


19071 


A 


5208 . 


401 


3 


FFFFFSETESHSVAQAGVQWHHLGSLQG 
PPPGF 


5171 


19072 


A 


5209 


402 


229 


GFFKKGPPFFFFFFFFFFFFFFFKMVWL 
RLSGSKVEWWGSQKLYMNFGCCRGRSW 
FT 


5172 


19073 


A 


5210 


198 


2 


PPQAKILSSSSPPIRPPPKKGFFPKNPQ 
VGFYSAPHKEKTFTLPAPVKFGPPKDPL 
KRPPP 


5173 


19074 


A 


5211 


516 


222 


GS TLRLTQRSMPAS S S TMAP S S KRLLLP 
RP E R S VP AAAGTAGHHE ASRNCGRGGAG 
ADEGPATKGDSHPKPGYCRAAHPSAAPW 
PPGPEKNFMRVG 


5174 


19075 


A 


5212 


103 


203 


DKVSLRCPGWPQTPGLKQSSCLSLPKCW 
DYKCE 


5175 


19076 


A 


5213 


409 


249 


RRNQVGGPRQLNPGKRIKSPGIDPYIYG 
QL I FDKAVRAI WWRKGS S FLQKALE 


5176 


19077 


A 


5214 


2' 


170 


ERKILGY I QLRKGPNWGPYGLLQP FAD 
AIKLFTKEPLKPATSPYPLQSPAFPLKP 


5177 


19078 


A 


5215 


270 


464 


GQLNKNVWGWDPGKVTFRDGVSLYCLHW 
NAVAIHGHRHSVPQPQTPGFKQSCLRLP 
SSWDYMPLY 


5178 


19079 


A 


5216 


156 


248 


PDKSYI ICVCVCVCVCVCVCLCVWVCME 
CP 


5179 


19080 


A 


5217 


412 


2 


PPPQIFFFSPPHFFSPPPKGGFFPPPPP 
KIFFFPPPLFFFWGFFPPFSPPPKKFFF 
PKSPPFFFFPPPKKKKFFFFPPLFFFPP 
PGFFLTPPPPFFFFFFFFFFFFFFFFFF 
FRMGCDRWHGEFWI LRDGFD 


5180 


19081 


A 


5218 


165 


1 


Q VKYHNKKTHN P I K I GKRS EQT FHE KDT 
QMANKYMKRSPTSLRIKGIQIKTIMRY 


5181 


19082 


A 


5219 


408 


88 


PSSSPHFFSPPPLGGVFPPFPFKIFFFP 
PGLFFFWGVFPLFSPPPKRVFFPNSPPG 
FFFPPLFGKKFFFSPPPFFWPPPGFFLS 
PPPLFFFFFFFFFFFFFFFF 


5182 


19083 


A 


5220 


3 


259 


NliLLLIVPILIAMAFLMLTERKILGYIQ 
LKKKKKKKKKKKKKKKKKKKRGGAFKKK 
PRGGQNF PGE KKKKF F FLRE VKKKP RGN 
F 


5183 


19084 


A 


5221 


10 


479 


KLMPAESDGRHRERERERERERERERER 
ESSSARARCVSLSLDRYKNRVYMKVAEA 
LCECRLLAYISQAPTQMFFLLRLINIIH 
AHTLTQENDMCLHTTLEILTLARQRERV 
PLYLRVLQRMEHKKKHPGFVLNNSHNLV 
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USSN 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F— Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T-Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RLWQQLYLHRDRDS I C 


5184 


19085 


A 


5222 


142 


56 


KFQPGAVAHTCNPSALGGWGGQI KRSGV 


5185 


19086 


A 


5223 


14 


471 


LEMRSEIDSQNYCNCDAGRNEWTSDTIV 
LSQKEHLPVTQIVMTDAGRPHSEAAYTL 
GPLLCRGDQSFWNSASFNTETSYLHFPA 
FHGELTADVCFFFKTTVSSGVFMENLGI 
TDF I R I ELRGKLATLDKSQGTHYLAI KA 
LTQKKKKKKTRG 


5186 


19087 


A 


5224 


480 


0 


FFFFFFFFFFFLIVAQFYFL 


5187 


19088 


A 


5225 


16 


422 


VRRTAR I RHEGKPYE CNACGKAFNRS AH 
LTEHQRTHTGEKPYVCKECGKTFSRSTH 
LTEHLKIHSCVKPYQCNECQKLFCYRTS 
L I RHQRTHTGE KP YQ CNE CGKS FSLS S A 
LTKHKRIHTRERPYQCKKKKKKK 


5188 


19089 


A 


5226 


99 


436 


GHP S FE I P ATMTAAP AGF P PQ VP WED VR 
YLFDE IMYGGHI TDDGDCKLCRVYLEE F 
MNPSLLAALSGTHACGEKPRLPANSHVS 
EPSWKWILQPQSSLQMTAAIxANILTATS 


5189 


19090 


A 


5227 


473 


9 


FQ ITATPHLAVYD PT VQFE FWFS E KQ I A 
D I RQ VE AS TR YLGTAL YW I AAS INIKPG 
HD YYFYI RS VNTVGKSAF VEAVGRASDD 
AEGYLDFFKGKIAESHLGKELLEKVELT 
EDIASRLEEFS KEWKDASDKWNAMWAVK 
IEQTKVGERDVTSG 


5190 


19091 


A 


5228 


1 


473 


PPPIDRLPNCTACRNSARVMTDAGQPHS 
EADYTLGPLLCCGDKSFWNSASFNTETS 
YLHFPAFHGELTADVCFFFKTTVSSGVF 
MENLG I TD FIR I ELRGRLATLD KS QG TH 
YLiAIKAIjTQKKKKKTRGGAGPPFPLIGS 
RI T I HGP PFNWAAMREK 


5191 


19092 


A 


5229 


170 


25 


KT PLLG P GT VAHACNL S TLGD Q S G C I MR 
SGVRDQLRQHDETPSLLKRI 


5192 


19093 


A 


5230 


465 


0 


WLYPPQKAQKKKKKKKKKKKKKKKKTK 
ARG 


5193 


19094 


A 


5231 


419 


56 


C VLLRS TKKKKKKKKKKKKKKK 


5194 


19095 


A 


5232 


79 


451 


AGSNLQEHRGLRPESPFNPGGGGCREPR 
RRHCPPAWAKRAKLRLKKKKHGRNQEVA 
QGRHGPFFPGLGPEGFLPRRPGSRKRRK 
GRNVPGPELKGPGAVTiFSRHRRGGADLR 
PPRKGPRGGGGG 


5195 


19096 


A 


5233 


39 


467 


VQQQQRAQEQQQQHPVLHLQPQQIMQLQ 
QQQQRQISQQPYPQQPPHPFSQQQQQQQ 
QAHPHQFSQQQLQFPQQQLHPPQQLHRP 
QQQLQPFQQQHALQQQFHQLQQHQLQQQ 
QLtAQLQQQHSLLQQQQQQQIQQQQLQRM 
HQQ 


5196 


19097 


A 


5234 


194 


45 


MILKFFFNFKIHFFFFLRQSLALVAQAG 
VKWCNVGSQQPPPPGFKRNST 


5197 


19098 


A 


5235 


204 


475 


AS ITVHLWYVEKCYSQSTEKVTPMPSSS 
W FGP Q PD AQLKHGLR P G AVAHACN P S T M 
GGRGGRI TS S GDRAHP 


5198 


19099 


A 


5236 


59 


357 


imWRKIYCKVYNRKKIKHNTTNTTKNYQ 
NFQRNRTKS S FT I FRI QSHYNKH I KKH I 
QRKRQSIETDLEMTQMLELVWIFKWIV 
ILTKIKTPKTLKING 


5199 


19100 


A 


5237 


447 


166 


ELLAFWQTQYi^n^MNAVNLFFFWIKAPGI 
PTPSFASRRTHSPFFSFLFFLFFFLGKK 
GFPLCGPGGPKSLNLKTPPLGPPKGGK 
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peptide 
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Predict- 
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nucle- 
otide 
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last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=L.ysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, Y=possibIe 
nucleotide insertion 


5200 


19101 


A 


5238 


34 


296 


L.S VAE FIAFWQTQ YNNMNAVNLFFAW I K 
VPGI PTPS FASRRTHS P FFS FL FFLFFF 
LGKNGFSPCGPEGPKFLNLITPPLGPPK 
GWE 


5201 


19102 


A 


5239 


340 


472 


SLNETHVRSPAHHRPGATAHACNPITLG 
GRDGL I MRS GDQDHPG 


5202 


19103 


A 


5240 


460 


0 


PKKKI FTI YFFYFFFFFFFFFFFFFL 


5203 


19104 


A 


5241 


161 


46 


SFYSTHRKPCHWPGSVAHACNPSTLGGR 
GGRITRSRDR 


5204 


19105 


A 


5242 


175 


36 


LKYYLIFIYLLSFFFLRQSRSVGQAGVQ 
WRDLSSLQPPNSTALQYR 


5205 


19106 


A 


5243 


11 


442 


DT I RWGLPTLGS KS TTNEKKRE KRRKKK 
EQQQS EANELRNLAFKKI PQKS SHAVCN 

aqhdlplsnpvqkdsreenwqewrqr.de 

QLTSEMFEADLEKAIjLLSKLEYEEHKKE 
YEDAENTSTQSKVMNKKDKRKNHQGKDR 
PLTV 


5206 


19107 


A 


5244 


459 


333 


flrvtqaglkllsssdpptlaspkcwdy 

RHEPLCPAQWSVS 


5207 


19108 


A 


5245 


229 


436 


FNSTWRKRGLGGFIHLNLIQTSFFAHL 
WSLTYVLGVSFFVCLFLRQSHFVTQAG 
VQWHNLGSLQPLP 


5208 


19109 


A 


5246 


227 


30 


VFFARWVFLRQDLDLiSSRLECSDVILVH 
CNVRLAGSRGTPSSVSKVAGTTGMLYHT 
WVFFYEFHR 


5209 


19110 


A 


5247 


173 


442 


FLGSSSKAAITLYYCQYMEFFFFFFFFE 
KNFFFSPPGGGEGPNFILLEFPPPGLKK 
ISCPTPPRGWDYGVAPPPLVNFWFFKKN 
GVPPPW 


5210 


19111 


A 


5248 


255 


358 


TG P GT VAHTCDP S T S GGRGGQ I TMS GDR 
DHPGQH 


5211 


19112 


A 


5249 


3 


130 


QPQLAAPS ISWAPTSASQVAGTTGVCHH 
AWIilFLFWVESRRG 


5212 


19113 


A 


5250 


90 


1 


STRLEDJNGTMSAQCNLRLTGSSNSPNSA 
SR 


5213 


19114 


A 


5251 


358 


447 


SFISYRRRSPSPYYSRYRSRSRSRSYTP 


5214 


19115 


A 


5252 


55 


202 


CWNTSMRGPIWEAEAGESLEPGGRRSCG 
EARTCHCI PAWWRAKLVDAA 


5215 


19116 


A 


5253 


330 


3 


EEMGFSHVGQGGFKILASGDTPAWAFQK 
GGISGVSPRAWAGFIIFIWNFSGGPQTG 
FPFYPFRGIFFPRAKINYIRVLNPKGLC 
QKGVFFFFFSGFPHYRCRLCQISV 


5216 


19117 


A 


5255 


175 


3 


M YLRF P LGLSLS ERPGMVAHACNP CT Y W 
E S KAGG S L S LGVQD Q PREHGGTL C LQ KS 
K 


5217 


19118 


A 


5256 


263 


1 


RPRRRFNRDKTALLQEMPFRTFTALGKL 
ISDFKDSKDGLTLYFVETASRSWLAGL 
ELLTSSDPPTSASQSTGITGVSHFSQPG 
LAF 


5218 


19119 


A 


5257 


1 


103 


CHVS QAGLE LLAS S D P PAS AS Q S VGI TG 
VSHRTW 


5219 


19120 


A 


5258 


128 


2 


I HKWRD C KMVQ P L WKRAWQ FLMMLMTHL 
SGDPTVLLLDILTQ 


5220 


19121 


A 


5259 


381 


2 


FFFFFSEMESPS VAQAGVQW CGLGS LQA 
PSPGFT 


5221 


19122 


A 


5260 


2 


173 


ENYLNMGGGGCS EPS LHETCS PALAIERD 
SVSTTTTTTKQQQQNNQLGLFSYPFNPL 
I 
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peptide 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=0Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y—Tyrosine, 
X^Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


5222 


39123 


A 


5261 


3 


175 


I KGNQ I GKKE VKLFLFTDDMI L YMENPK 
DSAKNYYKQSKTFCLPVVLGRSCKRTCP 
LA 


5223 


19124 


A 


5262 


2 


145 


QVS LKLLGSSD P PALTSQGAG I MGVSHR 
TRPRNTDFKNNTQKSKNTQ 


5224 


19125 


A 


5263 


182 


3 


STTS FLNFFLTEMVSHYVSQAGLKVLGS 
SDPPSSASHSAGITGMSHHTRPQQPLLN 
LjKV 


5225 


19126 


A 


5264 


183 


369 


WNSLDFFFFFPSFFRGYSSMGKVQDAFI 
FYRRF I DKS KPRANT WGS I GGLYQQQ I H 
PMDALR 


5226 


19127 


A 


5265 


329 


207 


RGCSELRSRHCTPVWVTEQDSVSQKRKK 
KSPCHLQLGTSQ 


5227 


39128 


A 


5266 


177 


13 


TLT I LLNFFFLRQS YS VAQAGVQWHNLN 
SI J KPPPPGFKRFSCI J SLPEFQQGHNIR 


5228 


19129 


A 


5267 


474 


122 


FICSPHPGGFLTLGPKLNILGPSQPLFP 
LRGGS KKQGI PFYS YKIKGP PGAGPPGS 
ARYSNPFGGYGGRVPLiPPEFPDPPGEHK 
PRFFLKKKKKGHQVPTSKYLPEYLFAYY 
MASLE 


5229 


19130 


A 


5268 


55 


311 


GIQTFGKNVWAGRSKNVGMP IAMLLHT 
DGEHERPGGDATVTIAHRYTPKEQLKIH 
TQLAD III VAAGS LSI SHAG VQ WRNHG S 
L 


5230 


19131 


A 


5269 


238 


133 


NKN I WVQKKC VF Y I YLPL I C VC VCVC VC 

vcvcvcv 


5231 


19132 


A 


5270 


336 


517 


PVLYILEHSPYNAS I YQLALKKYQSRPG 
AVAHACDPNTLGGRGGRITRSGDRDHPG 
QRGE 


5232 


19133 


A 


5271 


262 


467 


VLSMRPRIHGSAAREEDEHPYELLLTAE 
T KKWLVDGKT KGT F P TTP GRTNS KG I F 
KVCDPEWKGKMS 


5233 


19134 


A 


5272 


213 


2 


MKI YINVYI LLFSLKKRQGPTLS PRLE C 
SGMI IVHRSIiKLVGSSDPPALTSRIAWL 
IGARHTPDCSVCKF 


5234 


19135 


A 


5273 


284 


105 


YSCSYSYFDEPVELRNSSFFRWNHSSDS 
YWKKKS SKDTE P VLKP PGYSAR YE CKT V 
GSS 


5235 


19136 


A 


5274 


290 


449 


I F YF S GRVRAKL S AP LAGMGNAKAD SRG 
RSRTKMVSQSQRMYCLSALLYLSLC 


5236 


19137 


A 


5275 


248 


423 


NKVNITLMTKLEKDTTKKENYRP ISLLN 
IDMKILSKILQNQIQQYIQSIIHHDQVK 
LL 


5237 


19138 


A 


5276 


425 


30 


TRRGMPHRQGSPRKAPVSVRRQRVREEC 
GREPLLWFLQSQLLGRLRQETHLNWGGR 
GCSEPRS CHC I PAWATRANS I FCNFQAS 
S VE VRR S ARKKL FS D I L KRHNT I TWR VS 
GLIiL VD S Y FGRLAT P VRTQ 


5238 


19139 


A 


5277 


25 . 


185 


SLFSVAEFAFNAYKVQTEKEEKEEARSK 
YKEAKESFQRFLENHEKMTSTTRYK 


5239 


19140 


A 


5278 


263 


483 


VFTWF FQIiFCVYKL I FS FLTE CRS VAQ 
AAHAGAHWHNLGS LQP PLSGFKQFS CLS 
LPEFPRM YSVLS PS P 


5240 


19141 ! 


A 


5279 


25 


252 


LETLGLMTLVQCGGI PGFCHVGQAGLKL 
LTSGDLPASASQGAGIAGMSHHTRPILL 
HFIYFILLYHPPCLAYFIIF 


5241 


19142 


A 


5280 


326 


203 


IETESCYVAQDGLELLGSSNPPTSASQS 
AGITGVSHHTWTT 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C~Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G-Glycine, H=Histidine, I=Isoleucine, 
K=L»ysine, LHLeucine, M=Methionine, 
N=Asparagine, P-Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5242 


19143 


A 


5281 


210 


459 


NKJNJNKKFNEKKKKNYKKWARELYRHFS K 
DDIHVANRYMKRYPTSLIIRKIQIKTSM 
RYHLTPIRIATIPR 


5243 


19144 


A 


5282 


262 


454 


LLFTDYHEGMLHSVKLLFCFLGWSFTLA 
IQAGGHWRDLGSLCPIjLPRLKQSSCLNL 
LSSWDYKR 


5244 


19145 


A 


5283 


451 


303 


GTEFHHVGAGLELLGSSDLPVLASKSAG 
I IGMGHHAQPKQYS FVILTVQ 


5245 


19146 


A 


5284 


217 


489 


PVSISIILRCLSLPTPGSRVLCMIjEGMD 
GISLGWVPLPSLPVPLHPLDLSLPVCRQ 
VGGTKTGVVRYVGETDFAKGEWCGVELD 
KPLGKND 


5246 


19147 


A 


5285 


247 


99 


FFFFFFFFFLYGFFFCFFFFFFFFFFFF 
FF FLKNKTQP YF I YKRN I LSM 


5247 


19148 


A 


5286 


2 


117 


PR VRPR VRPR VR P R VRPR VR PRVRKKKK 
KKKKKKKKKKGGGF 


5248 


19149 


A 


5287 


121 


3 


KFWPGAVAHACNPS PLGGRGGW I TRSGD 
RDHPGQHSETP 


5249 


19150 


A 


5288 


416 


48 


IGKPGSFFPPPPSPRGAPPPPPKIFFLP 
PPPFFFRAGFPLFPPPPQIFFFPKIPPP 
FFFTPPLKKKFFFPPPPIFESPLGLFFF 
PPPPSLFSFFFFFFFFFFFFFFFFFFFF 
FFFFFFLGVR 


5250 


19151 


A 


5289 


503 


0 


FFFFFFFFSPFFFFFSSSFFFFFFFFFF 


5251 


19152 


A 


5290 


70 


236 


I F CNS QLTS PHKHQKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKGGGDFKKKIGGAK 


5252 


19153 


A 


5291 


443 


287 


QFTKKKKKKKKRKKKKNKKKKTKQKKT I 
IPKPPLF 


5253 


19154 


A 


5292 


334 


115 


KNPNAIiFFFFFFVDRISLLPRQWHNLGS 
LQPLPPGFTQFSCLSLPNSWDYRCEPLR 
PALLFFSSKSKIHIKPL 


5254 


19155 


A 


5293 


413 


3 


SSSPPIFFPPQKKKIFSPPPPKKFFFPP 
KTFFFLRGFFQIFPPPKKNFFPKKPQNF 
FFYPPKKKKIFFFPPPKIFPPPKIFLKT 
PPPFFFFFFFFFFFFFFFFFFFFFFLVS 
LEKGF PDAWADAWADAW 


5255 


19156 


A 


5294 


291 


11 


IQPYIKRLMQHDRVGLSSEKQSWFRIYI 
YXNVPGFQNKGFI I IGSSQKAQKMIFFF 
LDRVLL CHP GRS TWQSQ P TAAL KNWAQ 
KTLPPQPSE 


5256 


19157 


A 


5295 


2 


401 


NPRSTE AAI KYFLTQATAS I I LL I AI L F 
NNILSGQKKKKKKKKKKKKKKKKGGGPK 
KKKIGGGKKKPGGKKKIFFFIRGEKKNP 
RGE FE KKTFFWGGE KRANP P P KKKTLGE 
KKKILGGGGGKKFFFFVGKKK 


5257 


19158 


A 


5296 


49 


363 


RhQKKNKEKKKKKKKKKKKKKKKKNKRK 
KKKKKKKRRNGGGGKLLWKGGGQ ILTKE 
KILILVAVSYTCNINNLGRQRKWIIGGG 
KSEPNYEFKRNIGAGRGGSLL 


5258 


19159 


A 


5297 


100 


323 


E YLNTCVRI CHSHAHFLPPS YVS FALLP 

T) TpT T?XT\T\TT T \7~Mr\X TTpr^TtT^T^TZ' 7V X TXTV O T Tri/~i T 

Kr J-ir H V V UU Y lYJy V r ij-aiyl V KixoLiiiCj-J. 
FDD I VPDGKVK INVCFL F 


5259 


19160 


A 


5298 


595 


0 


VCVCVCMCACICIC 


5260 


19161 


A 


5299 


412 


141 


TVDTSYSEKKILFQILLLIDNGPGHLRA 
LMEMYKE I NWS M PANT TS I MQLMNQGV 
ISIFKSSYLRNAFCKAIAAIHSDSFDES 
GQSNLL 


5261 


19162 


A 


5300 


216 


2 


SQDWLRKFCYHPRVFQRQPIQKLMGLFF 
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peptide 
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Predict- 
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nucle- 
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ding to 
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acid 
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peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=PhenylaIanine, 
G=GIycine, HNHistidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibJe 
nucleotide insertion 














FCLPNMVKNGVFFFFEMEGRSWQAEVQ 
WHDNS PLQPKP PGLK 


5262 


19163 


A 


5301 


368 


3 


IFFFPQGKEGGGLFYIFFPPKKKGFFFK 
KKIFFFYPPPKKKKKIIRAAEKRGPPLF 
FLKKTRPNFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLVREK 
VFNSTISTQ 


5263 


19164 


A 


5302 


389 


2 


AMAQASGSCLCSRLLVDGFIPWGVFRYF 
I VEAGS CYVDQGGIiAILASS YPTTLVS Q 
SAGMTGLSHRARPPLNGPPPGLTPSLSM 
VPKAOTFSLLEKPTGLTQALRITIANVS 
NAPGQTRSGLEGWGCFF 


5264 


19165 


A 


5303 


84 


333 


FHD CCRP ERSR I LRNFLLS SAG I QS IAD 
AYRLALPQGLLYGPTNFAP I INHVARFA 
AHAAHQGTAS VRWGGS GSGGNQGLP 


5265 


19166 


A 


5304 


485 


93 


PPLFRRFSPPPPPKFFFSPRAFFFGGGF 
FSFFSPPKKSFFFKNPPGFFFFPPLKKK 
KIFFLPPLFLAPPKDSFIiAPPPFFFFFF 
FFFFFFFFFFFFFFFFFFFFSRENIFFI 
III F I YLFRDRFLFCCLG 


5266 


19167 


A 


5305 


157 


2 


PPPPFFFFFFFFFWWVITLNVELEPSF 
SPNTESQIGPEEAMERLQENRVE 


5267 


19168 


A 


5306 


I 


133 


TQES IMDDENDKQQLDERLKKKKKRPAV 
LKNP WGGQS LPGMARE 


5268 


19169 


A 


5307 


239 


2 


PPRNWGFFSPLSPLKSSSPPKAFNFWGG 
VGPNFPPPKKRFFSKNPPGGFFYPPLKE 
KNIFSLPPVNLGPPRVFFKGPPP 


5269 


19170 


A 


5308 


454 


116 


FRLP S S SMS PAPASRVAG IS GFRCVGRA 
GLQLLTASDPPASASQGAGIADGVSFTQ 
CSMVPRLECSGMI SACYNLHLPAACLGP 
PKCRDCSLCLAATPSGK 


5270 


19171 


A 


5309 


123 


1 


CSVSKWYPIAFLKNI IMLWEAEAGGSRG 
Q E I KT I IiANT VKP 


5271 


19172 


A 


5310 


153 


33 


WPGWAHTCNPPPIiGGRSGLITRSGVQP 
QPGQHSESPSI 


5272 


19173 


A 


5311 


350 


3 


NSRNLIIVTQEHSPEEKNEFLFSLPLQSV 
S MNTTHS P IjL SNS L PHF 1 1 S F L P TRF LV 
GPSPTRSFPPAQSPDRISFSSRLECSGT 
ITAHRSLNLPGSSDPLISVSQVAGTTCM 
CHH 


5273 


19174 


A 


5312 


1 


392 


RTRGRTRGI CKSITI IHHINRTNEKKNH 
MIISVDAESAPLQIHSAKLKK 


5274 


19175 


A 


5313 


1 


239 


L KT S FHHVGQAVLE F LAS SDL PALAS Q S 
AGITCMSHCAQPY I S INF I PAAETQQS S 
PELVPLPASQKTPSFSTPLFSLP 


5275 


19176 


A 


5314 


345 


237 


liYFYIYFLKSACVI ILSTLCVCVCVCVC 
VCVCLCM 


5276 


19177 


A 


5315 


268 


412 


GMISSNNSSDSFGNCLNPGGKGCCEPGS 
YHYP PAWATDTFS KKKKKKI* 


5277 


19178 


A 


5316 


112 j 


224 


PWMLENELGLHAS YLAMSTPLS P VE I E C 
ASKKIFTFC 


5278 


19179 


A 


5317 


377 


139 


KPPPRIFYLGPQKKKKIiFYPPPFKNCFF 
LGPPPPPIiFMIFFFFFFFFFFFFFFFFF 
P Q WVMFL F FLLS ASMQNS P S WS P 


5279 


19180 


A 


5318 


216 


385 


RGVLLCCPAWSQTPGLKGSSCLSLPKCW 
D YKHELPRPATS CNS ILFFNLQGAFMYH 


5280 


19181 


A 


5319 


376 


3 


RIDFGGPKKKILLPPPPAVKIVSDKGPP 
LFFFFLKS FFAPAGGQWGVFQL I S SGD P 
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Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Arginine, S=Serine, 
T-Threonine, V=VaIine, 
W=Tryptophan, Y— Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPPAPESVRIPGVSFCAGVPQGFFFFFL 
NSSVAQAGVQWRDLGSLQPLPPRSKRVS 
CLSHPSSWDYSH 


5281 


19182 


A 


5320 


377 


287 


ELLT S S DLiPALAS Q SAG I TS MSHHTQ PG 
SF 


5282 


19183 


A 


5321 


381 


65 


SSSPSSRTSGPFFFFSPPKKKKNFQKKK 
NFLYPPFFFPPPFYKNPPKKKLKGPKKK 
KKISPPPPKKNLSIiKNPPPFFFFFFFFF 
FFFFF 


5283 


19184 


A 


5322 


132 


396 


EGGFVLKIIREGIGPHFLGLEEEGSKFH 
EHI FWEKHRES FPKQGP I PHFMELGTWG 
LSKNPYFRVKQRVEHIEGFKNFFNEKKD 
FLKE 


5284 


19185 


A 


5323 


366 


1 


GFLKKGVPFFPPGKRGGGNQTPPKPFVL, 
GGKQPPPPPPLKKKNPKGGPPQQKKIFF 
FLTRGGVSLFGPGGGFP FFFQP PKKP P P 
GGFPFFFLKKPPPPPLFFFFFFFFETGF 
CSCRPGRTRG 


5285 


19186 


A 


5324 


131 


1 


DGALSPRLEWCSGTIIAHCSLRLPGSSA 
SCVSASRVAGITGVC 


5286 


19187 


A 


5325 


112 


1 


DGILLLLPRLESNGAISAHCNLHLPSTS 
ESTASAFLS 


5287 


19188 


A 


5326 


404 


18 


P VS PHGKNHE KHT VTCVGS TS GAFLHGE 
EHCHGNTHHPFEPSNPQTACQSQANRIR 
HGNS FKNLWRS WAWWRTP WSATQDAEM 
GRPLEPRGLSLAWATQQDPSLKQKQKQK 
KPTHLWSYILGHHQLP 


5288 


19189 


A 


5327 


109 


3 


KCVLTPVIRALWEAETGGSRGQDIETIL 
SDTVKPR 


5289 


19190 


A 


5328 


1 


117 


ETGFCCVAQSGLQLLDSSNLiLAIjVSQSA 
GVTGMSHRSRP 


5290 


19191 


A 


5329 


3 


120 


D AWVAAE AEGQNDT I EE PNKVQKRKRG I 
NDNVPAGQAH 


5291 


19192 


A 


5330 


105 


2 


RSGWAHI CNPSTLGGQGRWITRSGVQD 
QSVQDG 


5292 


19193 


A 


5331 


204 


411 


S KKKQKKKKKKKKKKKKKKKKKQDKKG 


5293 


19194 


A 


5332 


185 


2 


FFFETESHS I TQAG VQWRDLS SLQPLPP 
GFKSLPSSWDYRCLPPCPANFCIFSRDR 
VSLCW 


5294 


19195 


A 


5333 


561 


325 


WSTLHSFPIWLLNTGYYFKELTGFFFFL 
TVALSPRLECSDTI lAHYSLRLLGSRDP 
PTLVFQSTKITSVSHCTQPSVRF 


5295 


19196 


A 


5334 


3 


96 


EGKAAFS QEKSRRVKEENPE I AVSAS T I 
PE 


5296 


19197 


A 


5335 


375 


228 


LSPWLECSGTIIiAHCNLRPLGSSNSPGV 
FGFYFFLTLILGSGVHVQLC 


5297 


19198 


A 


5336 


403 


0 


N I S VPAS KH I S P F P KKKKKKKKKKKKKK 
KKSQKKKSRG 


5298 


19199 


A 


5337 


397 


80 


1 1 ITS KKKKKKKKKKKKKKKKK 


5299 


19200 


A 


5338 


406 


34 


KRTLKDS KKKRKE KKKKKRKKKKKKKNR 
E KKKKP KKKKKKKKRGGAQKKKNNHPAG 
GKAYFFFLGGTKKKRGGGDTKPLGEKNS 
PAQTKKLGGISTWPPFVERGKKKKTPGI 


5300 


19201 


A 


5339 


410 


0 


KKQ KKKKKKD AP GGGGGGRKKKGGP QKK 
KKGSHLKTPKKEKTGGA 


5301 


19202 


A 


5340 


306 


392 


IQYKILPQKENDWDKIQGQLAIPVSSLN 
F 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D— Aspartic Acid, 
E=GJutamic Acid, F-Phenylafanine, 
G=GIycine, H=Histidine, I-Isoleucine, 
K=L,ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5302 


19203 


A 


5341 


3 


173 


TFTATELI IFYI FFETTLI PTLAI ITRG 
GNQPERLNAGTYFLFYTLVGEGSLLARS 
R 


5303 


19204 


A 


5342 


247 


426 


GRAWWFTPVITPLWEAEAGVQWRDLSSL 
QPLLPGFKRFSCLRLPNNWDYRHVPPRP 
ANFL 


5304 


19205 


A 


5343 


3 


149 


S WFLHFGEAGLELLTSGDP P ALALPKCW 
DYRLVSNSWPEVILQPQPPK 


5305 


19206 


A 


5344 


464 


147 


LRRLRWENRLRPGGGGCSEPRSHRCTLA 
RVTESLSRVQCQRDPRRLAQTSRRPGAS 
S FPFAGPAFSAVNSTYQRALPPAQGAPW 
TPRLPLGPHGTTSSrSSYLPV 


5306 


19207 


A 


5345 


228 


1 


VIRLEETYFLLFFVLFLFLFFRQTRSGS 
FAQAGVQSSHLSLPSSWDHRYTPHLANF 
CI FLQKQSIiAMLPRLVSNTK 


5307 


19208 


A 


5346 


2 


106 


F VLELLGSNNS P S S AS ES AG I TGVSHCG 
QPLFNF 


5308 


19209 


A 


5347 


140 


436 


FLCDMRVLYVTFFFFGKGVSFFPPGWRA 
GGEFGL I EITP PPG FNP FFL LNL P KKGE L 
RAP P PAP I FFFFFFEKRGFTRLTQEGLN 
FWTWEPPPLTLPRGG 


5309 


19210 


A 


5348 


1 


465 


LESRRFEDGTRVPGSTISWDPPEEAVCP 
F S DL QLRAGRT T T L FKAVRQ GH S S LQR I 
LLP F VWQ C P APRGG VYRGRQAS LS C S GL 
HPVRASWLLCFPSEAWAMSGTPPPASLP 
PCNLISDCCASNEQGSVGIGPSEPGVEY 
NLLLRHLLKPEEKRS 


5310 


19211 


A 


5349 


183 


58 


ERDYEEGPGAVACAC I PS TLGGRGGWI T 
RSGDRDHPDEIVDP 


5311 


19212 


A 


5350 


288 


404 


I EXNGM I LAHCNFQLPGS SNS PAS AS P V 
AVITGMNHYAR 


5312 


19213. 


A 


5351 


413 


305 


AEAGLKPLS S TYLRAS AS QS AG I TGVSH 
HTQPLLFF 


5313 


19214 


A 


5352 


91 


356 


T VPKRNTFS P KMVAI TTAGTVS TS VLAL 
GNRESALPTRLGWALCCFIFWGGRGHRV 
PLCHPGWSTVARSRLTATSTSQVQAILL 
PQPP 


5314 


19215 


A 


5353 


2 


410 


FVQWQFLKTGSDRSHKAVESLGYEKVEK 
AMTVNYS FRS LNTKGKLELLGKI SVNLY 
AKWKHFREKGRRYTNTLGPTATAAFYST 
EKTEFLPWEQTCKPFIDSFNKYLLPGA 
ADHACNPSTLGGRGGKITRSGDRD 


5315 


19216 


A 


5354 


3 


288 


SLDDDLLKLLLPLMLQYSDEFVQSAYLS 
RRLAYFCARRLSLLLSDSPNLLAAHSPH 
MM IGPNNS S IGAPS PGP PGPGMS PVQLA 
FSDFLSCAQH 


5316 


19217 


A 


5355 


187 


3 


KKILWQTPGVNLFSFKGAPLFFFFLRRV 
LLCCPGWSAVARSPLTATSASRVQAILL 
PQPPK 


5317 


19218 


A 


5356 


383 


261 


GYKINGQKSQAFLYTNNRQTESRLWYCY 
DSPQRLIHLSIK 


5318 


19219 


A 


5357 


367 


241 


F FFRHVAQAGLQLLGSNNP PAS P SQS VR 
VTGVSHCTQPLCFV 


5319 i 


19220 


A 


5358 


217 


1 


SCDRENTGLKLLSILLLIHCETDTRISL 
LC I S LS I KRE I WLGAVARACNPS TLGGR 
GGR I TRS EDRD PGETP 


5320 


19221 


A 


5359 


255 


412 


TQ L CRKRNAF FFLATNHS Q KG Q S RWGT V 
AHACNPS TLGGRGGR I TRS GDGDH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K>=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5321 


19222 


A 


5360 


391 


271 


RS CH I AQAGIiKL LAS SNP PAS AS QNAE I 
TGVS HHLT P ICC 


5322 


19223 


A 


5361 


232 


397 


EGLKEPAGLFTMEYYTARKRNKIMPFW 
TWMQLEAI ILGESTQEQKTKFCMFSPV 


5323 


19224 


A 


5362 


213 


392 


LFIKVEGKFFLRSINILNESLLFLSSNG 
PGLLKKLGVWPGWAHACNP S TLRGQGG 
RTA 


5324 


19225 


A 


5363 


306 


418 


SDGITGTLPLASQESAWEDLLYVLVGV 
DGRYVSAQP 


5325 


19226 


A 


5364 


347 


48 


KKRWLKPGGGKNPPKKGGKKGGTPPFFQ 
KKGFFFPPFFFFLEKRDLSPQAGGKGEK 
KKPPQPWKAEGKKNPWGNPPQRGGYKGN 
PTRPPPQGFFFFFFL 


5326 


19227 


A 


5365 


412 


172 


PDSNLSFGAHYQASLLGSFSLMPLLLYS 
KCSGQWDVARSAECTSKSSQNKERKKER 
KKERKKERKKERKKTRKQKKNPLT 


5327 


19228 


A 


5366 


400 


296 


LVEGGFHHVGRAGLELPTS SDP PTSAS Q 
NIjGLQA 


5328 


19229 


A 


5367 


162 


1 


APSSLDETPTLLKKTLFGPGLVAGICNP 
STLRGRGGRITRSGVRDQPDQHGETP 


5329 


19230 


A 


5368 


152 


403 


TTVLCKLVQPLWKTVWRFLKELEVDLP F 
NPAVPLLGIYPTEKKSYEKGTCTHVFIA 
AQFQ IARI WNQS KCPP INEQKKKKGAPV 


5330 


19231 


A 


5369 


170 


367 


IVFEILVISYFKSFPQKTTEWNTFFFHP 
FKVFNWPSTVAHACNPSTLGGRGRQITR 
SGDRDHPAR 


5331 


19232 


A 


5370 


396 


1 


QMCVRAHTFAGGRARTHTHTHTHTHTHS 
WATPRVSHNHCVCVCVCVCVC 


5332 


19233 


A 


5371 


139 


3 


PKLECGASIIAHCSIiELLGSKDPPPSAF 
QVAETTGCAPPCLAFFF 


5333 


19234 


A 


5372 


1 


341 

- 


PLINILSHLILLCRNYFSFVKFCIRHTS 
LLYWI SLFHVLLCWS FLTDRRAHNAS 
VTYGFVLPVTFKNS PLPGAVAHACNPST 
LGGRGGQIMRSGVQDQPGQARSSRPALP 
T 


5334 


19235 


A 


5373 


280 


449 


I CHNLQLCPS KDT I EKVKRQP I EWEKIF 
ANHISHKGLSS 


5335 


19236 


A 


5374 


170 


1 


QKRGFSMGTPMVS I FRPQDLTPLGFPKA 
WDPRVNPWGPWPCPLFFKGNPQIiFQAPG 


5336 


19237 


A 


5375 


462 


165 


PPTRFLHWQTGLELPTSGDLPASASQG 
AGITGVSHHAQPRGHVFDVLLEYSQQGV 
FTLLS FLGFS VFFE PLFC 


5337 


19238 


A 


5376 


3 


451 


PRAKLGTRRGLRNLDT I GVAVDLI LLFR 
ELRVEQESLLTAFPCGLQFLRNIASRHE 
DSQSIGWVHAFPPPVWPCLSLPPPHTGS 
S S S ITIi P PLL PHRPHP PPHSHP PMATLS 
RPPYPPPPHPPPHPLPPSLSPPALSPPP 
PPPPPPPSL 


5338 


19239 


A 


5377 


460 


300 


KVLLCCPRLECRGAIMAHHSLELLGSSN 
PPTSASQSAGVTGVSHCRSPWEFIj 


5339 


19240 


A 


5378 


452 


191 


DSRFHHWQADLEHPTSSDLHASASQSA 
VITGVSHRTRPDIAFAPSISSYFSCVAE 
ISLiLHLSESFPKQFL 


5340 


19241 


A 


5379 


240 


395 


LATSSVANLRNTFYLKLKSSVLGVEAHS 
CNPSTLGGQGVRITKSGVRDQPGQ 


5341 


19242 


A 


5380 


210 


411 


TSVNSAPNSGQKASVEECFGQVTKVKIQ 
NVFLGWAQWLTPVI PALWE AGAGGS RDQ 
EIDTILANTVR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylaianine, 
G=G!ycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamtne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5342 


19243 


A 


5381 


406 


300 


FHYVGQPGLELLTIiGSTCLGLSKCWDYR 
REPPHQA 


5343 


19244 


A 


5382 


364 


479 


INDGISKISLFLSHLKDQAGQVWWLTPV 
IPALWEAEAG 


5344 


19245 


A 


5383 


182 


395 


HHLRGSAASFLKSAQEPTDFRHTGFKRF 
SCLSLPSSWDYRCPPPRRGGGVLCLPGW 
RT WQS WLTTTS I SQ 


5345 


19246 


A 


5384 


37 


478 


GGIHSYPVICLFILMGSSVSLPFSSVAC 
LLAWLLPSFLPSLLPSFLPSFLPSLTGR 
AFL 


5346 


19247 


A 


5385 


201 


458 


WNPPFLSPRKWAARADTEWGVGKTAQEP 
GLSSRLQAYEERQRHWQREREALREDCA 
AYAQCAQRAQQLLQLQVFQLQQEKRQLQ 
DD 


5347 


19248 


A 


5386 


206 


407 


FFSPGGRAGEKFGFMEPPPPGFKKTLWL 
NP PGKWKQRAP P P I PVYFLE FKKKGGWP 
NMVQVGLNPRP 


5348 


19249 


A 


5387 


468 


0 


NFFPPRPAPPFFFFFFFVAQAGVQWRVL 
GSLQAPPPGFTPFSCLSLPSSWDYRRPA 
RVSPHC 


5349 


19250 


A 


5388 


136 


2 


LFFFFLFETGFCSVPRLECSGTI IAHCS 
LKLPGSCNSPTSVSQV 


5350 


19251 


A 


5389 


397 


0 


ILYPSQKKKKKKKKKKKKKKKKKKSQ 


5351 


19252 


A 


5390 


299 


3 


AQPLAPFPGSFILMYFFMLLLFGIPLLY 
MEVI MGQWLHVDNI RVWKQLVPWLC SMS 
YAHS LVNEGP 3 PRP S P I SLCGL.QVS AHR 
VHPQVCASVSLYNS 


5352 


19253 


A 


5391 


264 


202 


FFFFFFFFFFFFFFFFSENKI 


5353 


19254 


A 


5392 


2 


150 


PRVRFL IETE FCHVGQAGLKLLTSVDPP 
TSASQSAGITGVCHLVRPEKS 


5354 


19255 


A 


5393 


382 


0 


NSRAQAWWLMP VI S PHWEAE VGGSLE PT 
S 


5355 


19256 


A 


5394 


395 


239 


SLPSSWDYRHAPQCWANFIFLVETGFSS 
VILCSFRLGGFGLSLNILCSLKSL 


5356 


19257 


A 


5395 


24 


265 


LL PHRKS RN YRTRPTRS LRQKNGMNLGG 
GACSESRSHHCTPVWATERDSVSKKKLF 
TIBL VYVFLKKETS KGEPL PKVLGI* 


5357 


19258 


A 


5396 


174 


118 


FFFFFFFFFFFFFFFFRSS 


5358 


19259 


A 


5397 


406 


134 


PSAPSSPPPPPLIWGPPGFFPPPLFKNP 
PPKFFFGAPKKKI YLPP PRALNF VLLKG 
PPPFFFFFFFFFSFF 


5359 


19260 


A 


5398 


294 


64 


IFFFFFFFFFFFFFFFFWELHILWGALF 
LIjKSR I P PGKKKNTCKI FLCTPIiTG YI P 
LLKCFQKQNKKSQSQNTELKS 


5360 


19261 


A 


5399 


1 


211 


LLLPTPYHHINQDSSNKYHETKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKRGEKKKK 
KKKKGGGGGYKKKR 


5361 


19262 


A 


5400 


305 


1 


SPPRPPFLLPFLFFLPSFFFKAPHTINH 
FFS PPPLFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFLKLYIiFYKMHRINYTSNI 
LKINI FVFSS PGSGGRR 


5362 


19263 


A 


5401 


416 


3 


ENTERDRGRQRREDRSRAETQETNRAGQ 
QESTPPPQKKCNTYSIPSSSSSSPTPQK 
GKKGGPPGGPPPPAGEKGGPKKKKRGPQ 
KKKIRPPPPGKKKPPPKTPPLFFFFFFF 
FFFFFHKPQTFIVGVNTYILLRSVS 


5363 


19264 


A 


5402 


154 


416 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
TJSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine s M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, v =Stop codon, A=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKDSGGGAF 


5364 


19265 


A 


5403 


327 


2 


PPPQKFFLGGVFFFPPFFYNPPPPFFFF 
SPKKKKYFSPPRGKIFFFFFPPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
IjRWS LAL VAQAG VWWLNHGS LQSIj 


5365 


19266 


A 


5404 


372 


181 


YKGLAHTRCS IRNFLNFIIiI IEMGSRFV 
AQAGLEPLGSSDPPTSASQSDGITGVSR 
RTRPINIC 


5366 


19267 


A 


5405 


110 


1 


PVPVSSTEVKKLVLWPGTVVHACNPKTS 
GGQGGRIT 


5367 


19268 


A 


5406 


2 


401 


GFHLTTYGNRKQMQRHVREASARPLQGT 
PPPGHALEWTLEWLLAAPRPRKAQVLF 
AI VAS ETSS WDREKLWTLS LEAKCKG I T 
LFVLALGPGVGTHELAELAELVSAPSEQ 
HLLRLQGVSKPEVNYAQGFTR 


5368 


19269 


A 


5407 


3 


181 


QLILFHDKLSPKLLSWGNVDILINFFLS 
HCS PERARRLQAKMMTNLVMAKDRLQLL 
GIN 


5369 


19270 


A 


5408 


371 


222 


SVEMGFHHVAQPSLEPLGSSDPPASASQ 
S AE I TGVCHC ARPNHDS YRRS 


5370 


19271 


A 


5409 


204 


416 


NQVGEWVLRKGTLGDLIMSMLFHFLWR 
S KDE KNTKI W PGAVAHT CNP S TLGGRGG 
QITRSGVRDQPGRHG 


5371 


19272 


A 


5410 


236 


308 


AGRPGTVTHTCNPSTLGGQGGRIT 


5372 


19273 


A 


5411 


375 


216 


GCSEPRWCHCTTPAWVTVQDSPPKKEKK 
RKE KKKKXjLTT ENS KKAL KRNG YL 


5373 


19274 


A 


5412 


1 


381 


RFHHVAQAGLNIiLGSNDLSAS AS QSAG I 
TGMSPRTQP 


5374 


19275 


A 


5413 


144 


3 


RPPP I FFFLAKKLRALFFFFHS VIQAG V 
QWRNLGSLQPLPPGFKQF 


5375 


19276 


A 


5414 


412 


32 


LFSKKKKKKKKKKKKKKKKKKKK 


5376 


19277 


A 


5415 


414 


2 


NNNSFVFFSKPIFLKTVGGAPPPSPIYN 
P I P I FliY I YP PHKWGGVY I PTHM YGGGD 
IWGYMYTPPPPPLFIYFFPPPPFYIEKK 
KKTVFCFFAQGGVRLCLKKKKNKDRGWG 
D TM TMGHGGRL EM VANVCGL KR Y W 


5377 


19278 


A 


5416 


1 


240 


GGEGCSEPRSCHCTPAWATERDS ISKKK 
KKKGGPFKRTHFSSPGLPSFFFFFGAPK 
FNSRARFLTPREGKNPGLPPFYPA 


5378 


19279 


A 


5417 


2 


63 


LCLKNisrwisnsnsnsTN^ 


5379 


19280 


A 


5418 


322 


1 


WPLCFSRCSKKRGSPLGFPFSPPLGQSK 
KFVRFACKTPSAFKPFPSLS FFQRWAGP 
PPFLQGLGELPKGLFFQQCFFFFFFLRW 
S liTLVAQAGVQWHDPGSLQPPPL 


5380 


19281 


A 


5419 


1 


392 


FRMAGS YPEGAPAI LADKRQQFGSRFLS 
DPARVFHHNAWDNVEWSEEQAAAAERKV 
QENS I QRVCQEKQVDYE INAHKYWNDF Y 
KIHENGFFKDRHWLFTEFPELAPSQNQN 
HLKDWFLENKSEAPSQKK 


5381 


19282 


A 


5420 


273 


91 


RWYIVLDNLIFILENICCETSLKIYLGP 
GWARACNLSTLGGQGGQITRSGVRDQQ 
GQHGE 


5382 


19283 


A 


5421 


1 


149 


GGGGCSE PRLRRCTPAWATERDS VS KKK 
NLKKF F C P GLGL YQ KQNTWTL ' 


5383 


19284 


A ' 


5422 


381 


179 


SSCLGLPKSWDYRCEPLCPARTALNLKT 
I KKS KDNRRWRG CGE KEMLMHCW WE CDM 
VWLCVPTQISC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5384 


19285 


A 


5423 


195 


369 


CSLSFIKE CKLH I SGLQDS S I SLVNLGL 
AIRNIRNKPGWAHSCNPSTLGGQAGWI 
T 


5385 


19286 


A 


5424 


253 


414 


KKKKKKKKKKKKKKKKKXKKKKXKXKKK 
KKKKKXXRGGAAKKKPHSPAPAERDL 


5386 


19287 


A 


5425 


160 


2 


NLKGSAPFSKKTSPFFLVKENPLFFFFF 
BME S LS VAQAG VQ WRDLGS LQ PLP 


5387 


19288 


A 


5426 


53 


426 


ASSSFFFFFFLGKGGSIFPPKGGGEGKN 
FFL WKPGAPGGGGP PP PTPRGGGKKGP P 
PPGQKIFLGMFFEKKRFHHGGGGGFKLR 
GKKKP PPLAPKRVKNKGGNPGGGQPLKF 
KKTLPEKMGLKK 


5388 


19289 


A 


5427 


60 


1 


KWPGT VAHACNP S TLGGRGG 


5389 


19290 


A 


5428 


87 


247 


KKKNPQKKKKKKKKKKKKKKKKKNKKKA 
P AGAGDL S L Y Y P RAGRNNL F FISTYLi P 


5390 


19291 


A 


5429 


209 


405 


KLNLWFSSLSSLKAMISAILLKLSVPI 
FPMKIVILSTLNSGSWPGAVAHACDPSA 
LGGRGGRIT 


5391 


19292 


A 


5430 


324 


237 


VMNAFVCVCVCVCSCICVCVCVCVTFLF 


5392 


19293 


A 


5431 


409 


3 


FFFFFSETESRSVTQAGVQWRDIiGS 


5393 


19294 


A 


5432 


270 


402 


MKT I FHL I R ITG WAQWLTPVTPAL WEAE 
TGGS RGQE FETTVKP C 


5394 


19295 


A 


5433 


107 


277 


YNFNNHR YREA I Q KWDEALQL TPNDATL 
YEMKSQVIIMKNFTFLMKLCYVCSVSEQ 


5395 


19296 


A 


5434 


95 


1 


RNRKNNGQS GT VAHACNPS TLGGRDGW I 
TRS 


5396 


19297 


A 


5435 


412 


105 


GGEGCSELRSCHCPPAWATRARLHLKIK 
KKERWDLAMLLRVGYSGHRQVHCTDHGT 
LQPQPF 


5397 


19298 


A 


5436 


276 


488 


PLPQPNKDSRLLPLLVCLGFLFAPLLML 
GHGP KRARLP I F FPQEAYF I TFRLVFAE 
KKGYRGPLTMCIiAPR 


5398 


19299 


A 


5437 


179 


2 


GISPLKGGTPSLKKKKKAKDFNRRVSKE 
DI QFTNKHMKRCITLL I IKQMQ I KITMR 
YLiP 


5399 


19300 


A 


5438 


3 


156 


TPLYSQILISLRQEHYFKAEGRGCNELT 
SYHCTPAWAKDGDSVSKETKQNT 


5400 


19301 


A 


5439 


47 


173 


SLQLFLQTEERKDSDDEKSDRNRPWWRK 
RFVSAMPKGNFIKY 


5401 


19302 


A 


5440 


3 


122 


TTLTLTSL I PP I LTTLVNPNKKNP P KL I 
RILVLFQTTEK 


5402 


19303 


A 


5441 


390 


489 


KYKSYSLQPPPPKFKHFSCVTLPKHWDY 
RREPP 


5403 


19304 


A 


5442 


86 


275 


RQEFETSQQGETPCLLKIQKLAGRAHLN 
PGGRGCSEPRLCDCTAAWVTRAGLSQIK 
KIKKKKK 


5404 


19305 


A 


5443 


424 


166 


CFFVSPKAKSQGWLGSPPPPPPGLKHF 
SAPTPRGLFLFFFFPRGVPPLGKGGSKS 
PPPGVFPPWPPQKGGDSGGYPPPRAKIF 
LF 


5405 


19306 ; 


A 


5444 


356 


3 


AFFWFTNPFPPGFKGFFCPHPLSGISKG 
GPPPRGNFFFFGKAHFFSPFFKFLPPIF 
LPFLAPQKVGFPGFGPPPKPSQGVFLFF 
FFFFFSFLFFFFFETESSSVAQAGVQWR 
DPDAW 


5406 


19307 


A 


5445 


198 


86 


PWPLGAQAILLPQSSCLNIiPKCWDYRRE 
PPHTALLNS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
D=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine 5 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^VaJine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, A =Stop codon, /=possibJe 
nucleotide deletion, \— possible 
nucleotide insertion 


5407 


19308 


A 


5446 


198 


86 


PWPLGAQAILLPQSSCLNLPKCWDYRRE 
PPHTALLNS 


5408 


19309 


A 


5447 


143 


3 


TMIYFYFYFFLSQSLALLPSLEGNGTIS 
AHCNLRLPGSSDSSADAW 


5409 


19310 


A 


5448 


324 


1 


KKNSATSFFSVEPFSTGGVLANKGALTQ 
KKILPCFKIKKIRKRVGAVAPARNHFLF 
FS PFFKEGLQG I VYFFFFFFELE S RS VT 
QAGVQWRNLCSPQPLPPGFKRTRG 


5410 


19311 


A 


5449 


403 


0 


LFFFFFFFFFFFFPKENH 


5411 


19312 


A 


5450 


114 


409 


LNIWTLKYLISKHLLIVCLPDTIL.SSLC 
LI KYI GLQGPCVLI FFFKQFSAIDKLGQN 
IAWGKFGFAHYSLLTKKWKLFGNITQE 
QNMIVTGGLSWWND 


5412 


19313 


A 


5451 


192 


406 


RNF F LGHLK YVKR F E Q I G WGRD FMF KT P 
KARATKNKMDKGDL I KLKS FCPAKETT F 
RGTGHP T KGE K I FAT 


5413 


19314 


A 


5452 


254 


48 


NAFG WL C VH YAL Q F I D KSVKNS VC WLGA 
VAHACNPSTLGGQGGWITRSGVRHQPGQ 
YGETLRTRASTR 


5414 


19315 


A 


5453 


1 


288 


PLLFILIPSPFSFPHPFCGPIKFILPRE 
KKKKKKKKKKKKKKKKKKGGGPFKKKKI 
FPRGGKKKIFFFWAQKKKIRGGVLKTGG 
RKKPGGKKNKSG 


5415 


19316 


A 


5454 


285 


415 


TSGCCFVFLVLKLS I FCKGKVLNAIEDN 
GLiKNSTFTYFTSDHG 


5416 


19317 


A 


5455 


372 


125 


ITPAFSFSPMGKKGVFPPPKFFGFPPGF 
PPPPFFKPPPGIIFLGPKKKKFFSPPPR 
VYFFFFKRPPPFFFFFFFFFFFFFFF 


5417 


19318 


A 


5456 


3 


159 


AHASAHAYDTIKDALGLLLFILSLMTLT 
LFSPDLLGDQKKKKKKKKKRGGP 


5418 


19319 


A 


5457 


109 


392 


NIIII KRSTSLFLRP I FPPHLFPFILYL 
FISLRTLLFFFFFKREFCFLPQVGGQGP 
FLNSL KP L PRGLMQ F S CLTLLRS GN YG P 
PPPPPINFFF 


5419 


19320 


A 


5458 


243 


402 


F L.K I GF I HKYL KE F VWG ILN I VFNRFNS 
HERAPHAIVFVNGTTIEGHVEKCYW 


5420 


19321 


A 


5459 


394 


105 


IPPPSPRPGGPGAAPPPGGGGSTTPPPK 
KKPPFFPKNKNQPGGGGGAPPSPPPRGL 
GPKIPLTPGTKVFKETKPPPFPPPRGAP 
PKFLSKKKKKET 


5421 


19322 


A 


5460 


56 


379 


VCFSPSIYGAWLNKSPLRSHGSSSTSG 
WGGLPS PFPAQRLPLASQ VQEHNGHVFA 
S YQVS IPQSCEQCLSYI WLMDKALLCSG 
EWLPHQAPKPSMSPTTPHCSRGLQ 


5422 


19323 


A 


5461 


184 


420 


FFFFFLNNNLFFFFFFFFFFFFFFFFLG 
RN 


5423 


19324 


A 


5462 


295 


384 


LGAVAQARWPITLGGRGGWITRSGVQDQ 
PG 


5424 


19325 


A 


5463 


413 


257 


FLNPGGRGCS EPKSHKS TLAWVTERDL I 
S KKKKKE KCW I PQRMPERGQ WS PD 


5425 


19326 


A 


5464 


2 


181 


SliYCSGWSQTPGLKQSSCLSFPKCWDYR 
CELPRPVYVLLYKTSFPKRYVFGIHLCF 
MSQ 


5426 


19327 


A 


5465 


1 


229 


TG S C Y VAQANL E L L S S S D AS T S VS QNAG 
TTGVSHHVQ P YT S S F S Q VP P LRAQ S AAT 
ETTSILWVQKLRLQWRCSV 


5427 


19328 


A 


5466 


1 


368 


KERETKKEREE ERRMRRRREGRKEKGRK 
ERKKEREKERKKREEKKRNNLGQSAGIT 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, l=Isoleucine, 
K—Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y~Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possibIe 
nucleotide insertion 














GMSHHTRPKSHOTLSKFTNLCWAAFKAI 
LGHMWGRLDKLNVEQCFSTELSALFS 1 1 
AT SHMRLWV I 


5428 


19329 


A 


5467 


183 


3 


VSLCYGEKFSPQMNESQLYFRPGAVAHA 
CNP S TLGGRGRQ I TRSGDRNHPGQHGET 
GRV 


5429 


19330 


A 


5468 


1 


128 


GGAGYSELRLCHCTRAWATRAKLHLKKK 
KKKGPLRALKKIGG 


5430 


19331 


A 


5469 


1 


155 


NKFPVLISNVINI IGMCHCTPAWATERD 
SVSKRKKKEKKRSKLARHSVLCL 


5431 


19332 


A 


5470 


113 


1 


FFFFFFQKTSSRLGTVAHACNPSTLGGR 
GGRITRSGD 


5432 


19333 


A 


5471 


407 


281 


NGMIPGGRACSELRSRHCTPAWATEQDS 
I S KKQNKS KNPGL K 


5433 


19334 


A 


5472 


1 


177 


GGGCS E PRSRHC P PAWATERDS YLHKMK 
QSKTKGERMEVACYCII ILFMLTDALEV 
LMA 


5434 


19335 


A 


5473 


76 


267 


KERREEKEKEKRKRSERKGRKKEGRRDR 
EKEKEERKDERREGEEGEEGRKEKKEKD 
IKRRGWP 


5435 


19336 


A 


5474 


133 


2 


LCRMWLFC F E AG S HS VTQAGVQW CGLN 
SLQPLPPGFKQFSCL 


5436 


19337 


A 


5475 


5 


394 


YGVGTKGFRAGVKEFQALFLL I TVHS QL 
CHIEGTRHHSRHI IVWDFINQIENGTF 
PPLVLSNGC I WLL IVYLFWN I FTRQEGQ 
FFYGKLSHYSVFVFCFLRRS FALAQAGV 
RWCDL.GSLQPLPPGFKRF 


5437 


19338 


A 


5476 


163 


2 


SE KP WWALTLMT FSHFYFFIY FYY I F I 
FQMESCSITQAGVQWRDLNSLQPLP 


5438 


19339 


A 


5477 


111 


2 


FLFLKQDLPGAVAHACNPSTLGGRGRQI 
TRSGDRDH 


5439 


19340 


A 


5478 


232 


0 


HLVWRMESGSVAHAAVQWCDLSSLQPPP 
PRFKPFPSS 


5440 


19341 


A 


5479 


388 


225 


ALFSSKKKKKRKKERKRKKEKKKKKESM 
QRKGGHLPRVCCGASMVS PGHKLSL 


5441 


19342 


A 


5480 


150 


2 


HRVAVIYEVFFVCLFFYFLFLRQSLTLV 
TQPGVQWRNLSKLQPPHPGFK 


5442 


19343 


A 


5481 


156 


395 


DSVCTCVL-HTCHSSGLSQGRSCIiHCSLG 
LPNSNCPCVCVCVCVCVCVCVCVCVSFC 
PLHQGVCVCVCVCVCVCVFLSAHFTKGC 
VCVCVCVCVCVCFSPPI 


5443 


19344 


A 


5482 


391 


115 


RACPDCRVRFPHTSPPCLPCGPEAEPGP 
GPALRE L VQ P L PGQLQPP FGM P P S GLGA 
APSPAPPACAWTRPPHLHPSSFSSSVPQ 
ISSLFLCF 


5444 


19345 


A 


5483 


392 


2 


FFFFFSETQFRSCCPRLECNGAISVHCN 
LRLPGSNNSP 


5445 


19346 


A 


5484 


381 


282 


PVPWGQAIFLPPPPPYLGIPGPPPPPGL 
FLFP 


5446 


19347 


A 


5485 


407 


3 


FFFFFFLRRSLTVAQAGVRWCDLGSLQA 
PPPGFT 


5447 


19348 


A 


5486 


2 


417 


QENHLNPGGVGCSEPRSCHCTPAWVTEY 
DKKKKK 


5448 


19349 


A 


5487 


3 


413 


VYLHPSLQPSLSSLVPDQALSHDIWLKK 
KKKKKKKKKKKRGGI S PPS P VGGVGPRE 
TWGVWGPGANLFQRGGGVRELFS QKKKG 
EKGQRNPEKKPPHKRGPGGPKEKFIFFE 
KKKKKGGKP FGKKTKP PPP RVKTP GG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F=Ph eny la lan ine, 
G=Giycine, H=Histidine, I=lsoleucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosme, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nuc)eotide insertion 


5449 


19350 


A 


5488 


229 


411 


IHGPGFKNGGRGKTLGLPNLMADKKI PF 
SPGGYKPKNP 


5450 


19351 


A ' 


5489 


190 


50 


YHHHHHCHHHHHHQCHHRHHHDHHYHGD 
NDYS SHHSRSTCHVAVTV 


5451 


19352 


A 


5490 


615 


402 


RWCLTLSPRLECSGMISAHYNLHLPGSK 
QLSCLSLPSSWDYRCPPPGKTTFSRGIR 
DALKPRKPKRINLiI 


5452 


19353 


A 


5491 


338 


53 


KGPPFFFFCKKGGLFCSNWFIjTPGGKWF 
SPPTPPRSGGKRVPPEGGGKGFSLFFFV 
EGGS CHI AQAGLKLLGSSDLVTSASQSA 
GIMGKSPHAWP 


5453 


19354 


A 


5492 


266 


412 


NPNFFERKSCSVSQAGVHRRELNSLKAA 
PSGYTPASCVSLPSSWDYRRP 


5454 


19355 


A 


5493 


285 


412 


RDGVSALSPGLECGGMI IAHCNLELLGS 
SDPSALiASQSTWNY 


5455 


19356 


A 


5494 


1 


1350 


MGFMTGICIEDNWGPSPESGSVLTVGG 
VRIQMLDRCHTAHCPVI PGLLRGQLWTE 
NLAERNSHDRRPFPTWCSHFAQDLWPE 
QS IKDS YQKVILRKFEKCGHGNLHFKKG 
CE S VDE CKLHKRG YNGLNQ CLTTTQ SKI 
FQ CG KY VKVFHQ F S NS KRHKRRHTE KKP 
LKY I EGD KAFNQ S S THTTHKK I DTGE KP 
YKCEECGKAFNRSSHLTTHKITHTREKP 
YKCEECGKVFKYFSSFTTHKKIHSGEKP 
YICEECGKAFMYPYTLTTHKI IHTGEQP 
YKCKECDKAFNHPATLS SHKKIHTGEKP 
YTCDKCGKAFISSSILSKHEKIHTGEKP 
YKCEECGKAFTRSSHLTMHKI IHTGEKP 
YKCEE CGKAFTWS AGLHKHRRTHTGE KP 
YKCEECGKAYTTSSNLTEHKTTHTGEKP 
YKC KE CGKAFNWS S DLNKHKR I HI GQKP 
RT 


5456 


19357 


A 


5495 


4 


146 


PPTRPTCYYLS FYVS YPTG YKEDTRMVN 
KHMKRCSTSLVYLLWINFY 


5457 


19358 


A 


5496 


5 


388 


CGCFFFS PPPKKKVFPPPFFFFAPGFFP 
PPLFFTPPPQFFFWAPKKKKNSPPPPGK 
Kir r LiKGPPPir 


5458 


19359 


A 


5497 


231 


163 


FFFFFFFFFFFFFFFFXXTNKWF 


5459 


19360 


A 


5498 


163 


2 


SVKFTLGKVSQFLKKHTPAGWAHTVIPA 
LWEAEAGGSRGQE IETILTNTVKPH 


5460 


19361 


A 


5499 


229 


388 


RTHLQAS S DL F C L YS LKTGS C C VAYAG V 
QRHDLSSLQPPPPRFKRSSCLRPPE 


5461 


19362 


A 


5500 


405 


289 


H VAQAQL KLIAS S D P PT PAS Q S AGVT S V 
SHCTHPRCLFV 


5462 


19363 


A 


5501 


286 


375 


GDFVRFSFWPGGGAHSCNPSTLGGRGGQ 
VT 


5463 


19364 


A 


5502 


20 


374 


I IiQFQNFWI VLFKNF YHFGKFL I YILNW 
FSGLFWYYFSELSCFSLSSKINILNLLS 
GILNVFFSISGVRKKIFLIKRYCWLGTV 
THACEPSTLGGQGRQITRSGDRDHPGQR 

V VJS.J.S»i-W 


5464 


19365 


A 


5503 


1 


185 


FHH I G QAGL I LLT S SNLPAS AS QKAG I T 
GMSHCTQP III FLYSLFLQLSFSWQDFV 
MRKKF 


5465 


19366 


A 


5504 


202 


402 


NSFSSTAKLREKYRQFLYTPIPTGTQPL 
PLIiTFCTRPDTVAHACNHS TLGGQGGQI 
TRSGACNQPGQ 


5466 


19367 


A 


5505 


118 


7 


I KKENNWPGAVAHACNPS I LGGRDGR I T 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidirie, I=Isoleucine, 
K=0Lysine, L-Leucine, M-Methionine, 
N=Asparagine, P^ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Un known, *=Stop codon, /=possibie 
nucleotide deletion, V=possibIe 
nucleotide insertion 














RSGDRDHPG 


5467 


19368 


A 


5506 


205 


422 


ANHFTS IRMAI INNNKQTENNKCWQGQG 
SVGTLVHCWWECKMVWMLRQ 1 VWSFLKK 
LNIQLSRDPATLLLAI 


5468 


19369 


A 


5507 


3 


425 


RSCHCTPAWATERDSVSQKKKKKKKKK 


5469 


19370 


A 


5508 


3 


86 


S KKLQVLLNCMTE I Y YQFKKDKAERS K 


5470 


19371 


A 


5509 


395 


108 


GVLEOTDTITAHCSLDLVGSSDPPTSAS 
QVTGTTGTCHHAQLGLPKCWDDRHEPRLi 
PAWNKTLRRKNWHS LLPWPAI QEIiE VG 
GSLRPRSLRLE 


5471 


19372 


A 


5510 


421 


59 


FFFFFFLRRSLAVS QAGVQWWDLGSLQA 
LQA 


5472 


19373 


A 


5511 


168 


3 


EICKMCFPVPFGVFFDWIVFVFVFVFVL 
RWRMESLSVSQAGVQWCNLSSLQPPP 


5473 


19374 


A 


5512 


446 


149 


FFFFFFFCFFETESCSVARLESSGMISA 
HSSHNLCI 


5474 


19375 


A 


5513 


74 


398 


DRI IDAGP KGNYARFMNHCCQ P3STCETQK 
WS VNGDTRVGLFALS D IKAGKNHFR I LQ 
LTSEFQGFCFLQTASSDYNFKHFYMYRS 
LELSS FTFLCNHHHHPS PELFS S S 


5475 


19376 


A 


5514 


73 


335 


EGGS LF C P GWSAGALF WL TAPS AS RGWG 
GAPGVPWGEVGQKNCWNPGGGGCNELG 
LHHC I P P WGTGKTLS P KKKKKKGATDMG 
GAS 


5476 


19377 


A 


5515 


416 


0 


WLGPGAVAHTCNPTTLGGRGRWITRSRD 


5477 


19378 


A 


5516 


242 


423 


AGSPLIHQGGQAI ILTQNPTPGWGTMVF 
QPVLRPVKGLYNANHVTSSPVASQPIIF 
.TTAG 


5478 


19379 


A 


5517 


2 


151 


GRVGFVGMGSHCVAQAGLELLGSGGPPT 
SASQSAI IAGVSHRAWPYFPS 


5479 


19380 


A 


5518 


312 


405 


GQVRPGAVAHACS PNTLGGRGRI TRSGV 
PDQ 


5480 


19381 


A 


5519 


3 


387 


ACSGPRSCHCTPAWATHQDSVSKKKRKK 
KREKP FLLE KKRG P TLT WGE PKKG I PLG 
ERNGPPSHFKKGPPLAPKPPGGNLHPEI 
AQGPGS PTDGNKKGDLNPTPVF YKFKTP 
EQAINWAAPPFPSEK 


5481 


19382 


A 


5520 


292 


140 


KKGPPRARPGGGGSPPLFFFFFFFFFFF 
FFFFFWESTLKKRELHRSDLCV 


5482 


19383 


A 


5521 


1 


399 


KKWLFLSSAQIPTKLSRRRRNRETWPKE 
RSKINLHKPILKKQRHELPDKEFKIALI 
KML YE L KVTMHEQNEN* I NKE TENTKKKK 
KKKENPPLFGTGKNQSPPEKFQMGGQPP 
PPPPQKKKPPPPNRFFVFSPP 


5483 


19384 


A 


5523 


316 ! 


399 


MLLYWPGTVAYACNPSTLGGRGGRITRL 


5484' 


19385 


A 


5524 


400 


0 


FFFFFFFCFFFFFLFFFS 


5485 


19386 


A 


5525 


3 


429 


TTNIGRAMGATTSIVGSDTSQAERPGGT 
TWSPGASSTSQSSRPGTSVTPDSSASEf 
SETVTTKEFSGTTAI SRTSHTGTPAASG 
GQATGSLTATTGVAPGTTVAPGSSNTEA 
TTSVGEKKKDQKKKKASSP 


5486 


19387 


A 


5526 


135 


3 


GGKICTLIKQSVIQESWPGTVAHACNTS 
TLGGRGGWIMKSGDR 


5487 


19388 


A 


5527 


105 


5 


KKKKKKKS GRWPGAVADACNPNI LGGQG 
GQITI 


5488 


19389 


A 


5528 


411 j 


31 


GQRVPLSRGVAKKKKFLWNPGGGAGKNG 
GKNFLKKKPPGAPPPSGGPKKGPGPPPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
VSSN 
09/535,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q— Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














KKPL I LVS PP PRKGGAHNMHFRQKKSGN 
KGGEGSTKGGKEKI FLGKFVQKNFFFFF 
DPEFHSCCLGWSAMP 


5489 


19390 


A 


5529 


411 


142 


PRFSFLFSPPRKKEVFPPPKILGGPRFF 
P P PR FIiKP P PHF F FWGPKKKKKF P P P P G 
EKFFFFKGAPPPFFFFFFFFFFFFFFF 


5490 


19391 


A 


5530 


174 


401 


KKKKKKKKKKKKKKKKASPGGGGKKKKK 
ERGGGGERKNI I WVPLLFPP 


5491 


19392 


A 


5531 


410 


0 


QPQS KKKKKKKKKKKKKKKKKKKKKKAS 
Q KKKKKKKKKKKKKKS S GGGF F KE V 


5492 


19393 


A 


5532 


383 


145 


GGFFFNPPAKKGSFPQPHNLGGPPVFSP 
PPVFKTPPRNIFFGAPKKKKFFPPPGAK 
KYFFKKAPPPFFFFFFFFFFFFF 


5493 


19394 


A 


5533 


418 


328 


KTGFRHVGQAGLKLLASSDLPALASQSA 
GL 


5494 


19395 


A 


5534 


169 


3 


IILIIHPCSHVPRRETLGLKACTFKNLF 
WPGMVAHACNP S TS GGRERR I TRS GDR 


5495 


19396 


A 


5535 


1 


254 


PTRPCKPPQTSPNSGKLSPSSNGCMNTL 
HISS TNT VGEVIEALLKKFLVTESPAKF 
ALYKRCHREDQVYACKLSDREHPLYLHD 


5496 


19397 


A 


5536 


43 


404 


QGGLTVLPRLVLNSWPDKRSSRLGLPEC 
WDYRYEPPHQAN 


5497 


19398 


A 


5538 


2 


212 


KENHLNPGGGGCSELRSCRCTPAWATER 
NSVSKKKKKN"FRGGGLiKGKGPGFYPLEK 
KFPGVGFFPKTNLV 


5498 


19399 


A 


5539 


2 


387 


I FQEHKNCGEMSEIEAKVKYVKLARSLR 
TYGVSFFLVKEKMKGKNKLVPRLLGITK 
DSVMRVDEKTKEVLQEWPLTTVKRWAAS 
PKSFTLDFGEYQESYYSVQTTEGEQISQ 
L I AGY I D I I LKKKKKKD 


5499 


19400 


A 


5540 


317 


407 


GLSHHVFFEEGKLKKPRIKDKDKKVPEP 
DN 


5500 


19401 


A 


5541 


120 


3 


NFFFFFLRRSFALSPRLECSGTISAHCN 
LHLPGSSDSP 


5501 


19402 


A 


5542 


430 


0 


QKAQSNG PE KQE KRGVI QNFKRTLS KKK 


5502 


19403 


A 


5543 


172 


2 


F I FN FLH I F S FW FIT KFGC F I VL FLRQ S 
LALLPRPGVQWRDLGSLQPPPPRVRPRV 


5503 


19404 


A 


5544 


149 


2 


EKDIQELNSALHQADLIDIYRTLHRKST 
FFSAPHRTYSKIDHIVGSKAL 


5504 


19405 


A 


5545 


84 


276 


TLKKRPKEIRNQGGSPQRKRAQLGKLQQ 
KY P VLN F KAP FFGERGD Y YKS F F KT CLD 
NLPRRGKG 


5505 


19406 


A 


5546 


2 


914 


RPAWHEEQKSKNEPEDTEDKEGCQMDKE 
PS AVKKKPKPTNP VE I KEELKS TS PASE 
KADPGAVKDKASPEPEKDFSEKAKPSPH 
PIKDKLKGKDETDSPTVHLGLDSDSESE 
LVIDLGEDHSGREGRKCsTKKEPKEPSPKQ 
DWGKTPPSTTVGSHSPPETPVLTRSSA 
QTS AAGATATTS TS S TVT VTAPAPAATG 
S P VKKQR P LLP KE TAPAVQRWWNS S S K 
FQTSSQKWYMQDSSSSSSSSSPQNQQQQ 
PQS SQGTRYHTRQAVKAVQQKE I TQS PS 
TSTITLVTSTQSSPLVTSSGSMST 


5506 


19407 


A 


5547 


2 


244 


TRL I F VFLVE TGFRRVG Q AGL KLLT S G Y 
VGTSASQRMTAHLCLPRCWDYRCEQLHP 
AQRFFKRMLSVAVDAALLHTSGGM 


5507 


19408 


A 


5548 


406 


1 


FFFFFSETESCSISQARVQWCNLSSLKP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5508 


19409 


A 


5549 


417 


43 


IRPARKGCSSGLFGAPLAGGGAPPPPPP 
RQTS P P PPHTKGGGGGPHKP P P BCKKR P P 
PKKKGGGKKNP P KKKKKNTPP P PKS AP P 
PAAKTDPPP PLT VP PFL P P YI FTKKI P F 
LKSYFFFFY 


5509 


19410 


A 


5550 


3 


136 


QPRFPTLTNINENNPTLLNP I KRIAAGS 
LFAGFLITNNISYHSP 


5510 


19411 


A 


5551 


75 


194 


NKELSRNKKKKKKKKKKKKKKKKKRGGG 
LKKKNGGGAKT 


5511 


19412 


A 


5552 


343 


493 


1 1 FS FFFFFFLKGS FVFAPQAGGHGGNF 
GPLKPPPLGLKEFSCLTLLSSW 


5512 


19413 


A 


5553 


148 


3 


DAVSLFCPGSLEHSDTI I AHYS PDLLGS 
SSLSASTSRVAGITGICHHA 


5513 


19414 


A 


5554 


407 


252 


KKKFLKKLAGQGGMRLQSLLLRRLRQED 
HSSSGVQSCSELWGGHLIYSKNR 


5514 


19415 


A 


5555 


3 


231 


HE FVFLVETRFHHVGRAGIiELL ISSYPL 
AS AFRE AKARREDLLS P G VKD KPGQ YRD 
TLTLKKKKKSFFIYLGECW 


5515 


19416 


A 


5556 


3 


131 


SVWSWEESKDAKELVGYYIEASVAGSG 
KWEPCNNNPVKGSR 


5516 


19417 


A 


5557 


3 


131 


SVWSWEESKDAKELVGYYIEASVAGSG 
KWE P CNNNPVKGSR 


5517 


19418 


A 


5558 


462 


255 


RSHHSAQAGLELQASSDPPALTSQSAGV 
TNVSLRAQPILACSSVEVSSRCQVCLHV 
RRGKCPRAESYR 


5518 


19419 


A 


5559 


445 


29 


KSSNILILQKVENGDLSNKILKITDFGL 
ARE WHRTTKMS AAGT YAWMAPE VIRAS M 
FS KGS DVWS YGVLLWELLTGE VP FRG I D 
GIoAVAYGVAMNKIALiP I PS TCP E PFAKL 
MEDCWNPDPHSRPSFTNILASCQAWR 


5519 


19420 


A 


5560 


225 


475 


GAVE S S CTEEHMW I PEKAVEGS LTPTPE 
SRSVAHAGVQWHNVSSLQPPPPGFRRFS 
CLSLPSSWDYTSAGEGCAASYRSKVLK 


5520 


19421 


A 


5561 


248 


142 


NKNI WVQKKCVF Y I YLPF I C VCVCVC VC 
VCVCVCV 


5521 


19422 


A 


5562 


262 


456 


NFPLKTTYFLKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKAS S P P P L I KRGPKN 


5522 


19423 


A 


5563 


314 


444 


LKSMFCFVLFCFFE I ESRS VAQAGVQWC 
HLSPLQPPPPGFKRS 


5523 


19424 


A 


5564 


220 


486 


LNSSHPESPLPLFGLVLQFFYPSNPKSF 
NKKNLEIRMLLTVDRGPNHPDSLFFVHS 
NVDT VFLPQEGPS FLQ PLGQG I TEAFEH 
NQAYV 


5524 


19425 


A 


5565 


176 


492 


CTKTAVAS KAR YLQALL YALAS VT I AAT 
ATRLAS I SGAS PIFRS FPYLQPTAPGLV 
LPS I L ILVDVGLRKEKGPDTVAHACGPG 
T LGGRGGR I TR S G VRDRPDRH 


5525 


19426 


A 


5567 


24 


470 


PMDXKXPXXXETPLNGGAGENQDDMFAK 
LKEKLFNEINKIPLPPWALIAIAGVAGL 
LLLTCCFC I CKKCCCKKKKNKKE KGKGM 
KNAMNMKDMKGGQLPQDDDDAKTGLTEG 
EGEGEEEKEPENLGKLQFSLDYDFQANQ 
LTVGVLQAA 


5526 


19427 


A 


5568 


45 


485 


GXDGRLLRGHNQYAYDGLDYIALNEDLS 
CWTAADTVAQITQRKWEAAREAEQLrAY 
LEGLCVEWLRTHLENGKETLQRADPPKT 
HVTHHPVS DHEATLRCWALGF YPAE I TL 
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TWQRDGEDQTHDTELVETRPAGDRTFQK 
WAAVEVP 


5527 


19428 


A 


5569 


221 


422 


FFFLTGRVRQSLADLEQKIEVHSMPPFE 
S LAL S P TL YCRGA I LGAT S AS QVAG I TN 
ICHHAQPSYLF 


5528 


19429 


A 


5570 


401 


477 


I F S RDKD VAML PR I VL.NS WAQ VI LL 


5529 


19430 


A 


5571 


102 


36 


E VENNDD I SHHHHHHHHHHST 


5530 


19431 


A 


5572 


106 


407 


AGRQG W LEGCRP AAWP TL LTP P ATS S R 
GASRPPTATCPTPDPAPATTTWQPWPPR 
PLWAGVQWHDLGSSQPPPPGIKQFSCLS 
CPSSWDYRGRRAVCHH 


5531 


19432 


A 


5574 


253 


363 


YHSNQCVGLRKWPGAVVHACNPSTLGGR 
GGWIMRSGA 


5532 


19433 


A 


5575 


1 


132 


MECHYVAQAGLELPGSSDLPASASQSTG 
ITGVSHCAWPGFFFFF 


5533 


19434 


A 


5576 


18 


378 


TPGGGGC I EPRWHHCTPAWATRAKLSYK 
KKKKKKK 


5534 


19435 


A 


5577 


21 


232 


ILRCTTIETLLKVQDNHLNPGGGSCSEP 
RSHHCTPTWARERDSVSKKKKKGGAALW 
DPGGGQPSAGSQTR 


5535 


19436 


A 


5578 


377 


2 


liSL I QT I TLCLGA I TTL FAAVCAVTQND 
I KKI E AFS TS S Q I GLI I VT I G INQPHLA 
FLH I CTHAFFKAX LFMCSGS I IHNLNISrE 
QDIRKIGGLLKTIPLTSTSLTIGSLALA 
GIPFLTGFYRTRG 


5536 


19437 


A 


5579 


2 


227 


I ES WLE PVRFLRS MFANNLWD TSDSDV 
YHLLKDLEEGIQTLMGRLEDGSRRTGQI 
LKQT YS KFDTNSHNHVDAA 


5537 


19438 


A 


5580 


113 


3 


NFP F FFFLRQS LAVAQAGVQWCDLGSLQ 
PPPPWFKQ 


5538 


19439 


A 


5581 


184 


3 


IMPPHSRLSPTARFCLQKKTNNNNTPIE 
MASIQCPGWSQTPGL.KPPSCFVLPKCWD 
YRCE 


5539 


19440 


A 


5582 


381 


3 


FFFFLGDRAPPGGEKNPLGGGLKTPGKP 
PGGWGVSPPPQKKKMPPPNFFPPGKKKS 
FGGGGP I FFPGKNF P P PTQ KKKRGGGGF 
FFFFKKPGGFFFFFFPFFFFFSSFFLGK 
SFIKISFKKRRRG 


5540 


19441 


A 


5583 


314 


389 


FSCLSBLSSWDYRCVLPHLGNFLYF 


5541 


19442 


A 


5584 


131 


1 


FPLWTSEQGVGRNKQTYVTWQADCKENA 
GGDYYWTFFPQPTFV 


5542 


19443 


A 


5585 


484 


3 


MTE I LLKQAMVG I VGS I GS AI GGAVGGG 
ASASGGTAIQAAAAKFHFATGGFTGTGG 
KYEPAG I VHRGE FVFTKEATSR I GVGNL 
YRLMRG YATGG YVGT P GS MAD S R S QAS G 
TFEQNNHVVINNDGTNGQIGPAALKAVY 
DMARKGARDS WS AGDQ I FHH 


5543 


19444 


A 


5586 


136 


48 


NTMQ PG WTHTCNP S TLGGQGGR I TGSG 
R 


5544 


19445 


A 


5587 


306 


479 


DS FFPGVGSAFL YQ VAAWTTKYFKKRV 
ALi S TD I ARS E MRL T FLLAPF T KF L I D VC 
DW 


5545 


19446 


A 


5588 


201 


2 


ILETGFHHVAQAGLNLLASSNLPASTSQ 
SAGITGVNHHAWPSFLIQGTKHSFKNSK 
PVFHIKLSKN 


5546 


19447 


A 


5589 


30 


477 


TPSLARGVYPCWLYTGFLHDFLFSPKIE 
ERFMDVGVLRATEPSNSIiFAI IYSTPIE 
LAIKSWHIIGDGLS IELVAVERGNRTTI 
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G^Glycine, H=Histidine, I=Isoleucine, 
KNLysine, LHLeucine, M=Methionine, 
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ISRLPEFEKSSLSDQSLVTLTSGYFAVY 
RVKLTAKKLEG I HEGAFQ I TTD YE ILT I 
PVKARNALC 


5547 


19448 


A 


5590 


208 


91 


KGHFWVTGKKKVPCPPPRFFFFFFLDGI 
SLCYPGWSAW 


5548 


19449 


A 


5591 


325 


467 


ADLCSPCDILQLDFRHIRKTVDTLLALG 
EKAPPPTSALRSRDLISFCF 


5549 


J 9450 


A 


5592 


332 


403 


LSLCVXHTHTHTHTHTHTREIFIY 


5550 


19451 


A 


5593 


292 


26 


FLLDMEGAEAGAELGLYHCLLCAWETPS 
RLkVLQHLRTPAHRDAQAQRRLQLLQNG 
PTTEEGLAALQS I LS FSHGQLRTPGEEE 
GTGE 


5551 


19452 


A 


5594 


387 


51 


LKKI FLGRGQGVS L F FS PPLGGPGGRFL 
GliE I GAL FGL PGKTP PFPK I KKN YPGL W 
GGALFPLPLEGLGGKMAFPRKGGAPFSQ 
KNP P P P PLGGKTKP P PPLKKKKKKKVL 


5552 


19453 


A 


5595 


88 


362 


AETS PGQGS YQDGLRRPGFAEGP PAGRN 
PRRKKTQTDRRGGSAGSQPQACADGVRL 
CTRLQCSGAISAHCNLCLPGSSDCPASA 
SRVAGIT 


5553 


19454 


A 


5596 


92 


366 


DQPWPGQLPGRSPEAGIRGGSTSRKKPQ 
E EENSDRS PGRQRG I PASGVRGRCAALY 
QAAVQWCHLGSLQPLPPGFKRLSCLSLP 
SSWDYSH 


5554 


19455 


A 


5597 


3 


138 


TRPRTRGVGQAGLELLTSSDPPTLASQS 
AGI TG VS HRARPQ S S S 


5555 


19456 


A 


5598 


150 


3 


S C VI YS WMKFFS FLRWS FTL VTQAG vq w 
CDLSSLQPPPPRFKLFSCPG 


5556 


19457 


A 


5599 


148 


3 


TQHLNLAKKSQ ITPGMVAHS VAHAGRQW 
CDLGSPKPLPPGLNRASCLP 


5557 


19458 


A 


5600 


195 


1 


MVYFKKFCGFFLFVRSRIACSVAQVGMQ 
WCNHSSLQPRTPGLKQSSHFSFPSRWNK 
RHMQPCPGA 


5558 


19459 


A 


5601 


228 


415 


VNVS I PKDVFLLI I I ILFFFGPAPQAGG 
RGCNLNSLLPWPPGLKQFSCLTLPRSWN 
YGLPPP 


5559 


19460 


A 


5602 


405 


2 


AATGVDRHAQRTQAGLPLKTPQTPCPNF 
TS KARGTPSAQHP S CTRQRHQASNPTQR 
RRRRPQAEHS I RTD AHG PGL P AS GHMAG 
ERPTRKAQSRDVSVLRQRLALLPRLECT 
S AI SAHCS QNLTGSGNPPVS CL 


5560 


19461 


A 


5603 


3 


236 


RQENGMN P GGGAC S E P R S RHC T P AWAT E 
RNSVS KKKG I KI KGHCS S AMHHQ YLS P F 
GWRQVCSHCVCPKPWLLVRGH 


5561 


19462 


A 


5604 


184 


2 


RP FLR I QGKI RMFSGS RFI FNFL ILETW 
SHSVPQARVQWCHRSSLQPKPLVFKSSS 
YLLP 


5562 


19463 


A 


5605 


261 


2 


GWFLKGPPLFLGPKKPGGLGFFQKKLVL 
QKKNPPFGIGPPKNLKKTPLFFFFFLRR 
SLALSPRLECSGTISAHCNLRIPRTSNS 


5563 


19464 


A 


5606 


134 


2 


GS PFKNLVFFLGQSLTLSPRLECSGTTS 
AHCKLRLPGPHQSPAS 


5564 


19465 


A 


5607 


3 


269 


RQESHSNPGGGGCGEPRSRHCTPAWLAT 
ERDSVSKKKKRKRNFLNFIGKKKKPCLP 
KPKLTKKKKWGGITALVLKLYKRAIAPK 
TGGY 


5565 


19466 


A 


5608 


36 


380 


P P FXXP P GGE TFGGT PRGP P P KKKKS PR 
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KRGGGGKKKP PRGAPP P KHLLGGKNRGG 
GTPPPPRKTPPLNPKKKRGGGKRGVFFP 
PKGGIFQKRGFFFSRGGGGGAPPPHPPP 
GGG 


5566 


19467 


A 


5609 


3 


411 


SSRSRAAADFYFFFFSRRGQKEKEPFLG 
GKTPPVFVFFPGGVGKKNRGFPPPPPAG 
KGPFSGGGGREQTP I WS PPPPPRGKWGK 
PPKPVTPNFNFLGRVGFLGKNPSPGGVC 
PPPKTPIFFFGRPPPGGFFWGGGG 


5567 


19468 


A 


5610 


364 


483 


ISWVliFFLFLKWS F VL VAQAGE QWRGLG 
SLQPLPPGLKGF 


5568 


19469 


A 


5611 


399 


285 


GGCSEPKSRPCTPAWGTERDFVSKKKNK 
NNYLVKKSN 


5569 


19470 


A 


5612 


408 


3 


KE VFS FP PGLEHGG VF YS L P PL PPR VKG 
FSPPPPPGGGGPRGPPPPPGYFLLFPKK 
GGFPPLPGGFSSPQIWPPRPPQKVGVPG 
GAPRPGPNFFFFFFRESFPFLARGFPKK 
KKKTC WNVTT SHS AS VQS S VLL 


5570 


19471 


A 


5613 


1 


112 


LAQHVFFIiLNTS IAS PADS SELLMEVHG 
NGKRPSPER 


5571 


19472 


A 


5614 


2 


389 


FVREGERKKEKRKKEKKRKEGTKEKGKE 
EGRKE RKKE KVRKKT VD DRRR P Q VT F Q A 
VNLE S EGAPGN CGVTAP PGGH I WLKQVE 
VQTHLENRCHRLKATVTTTTRPGAEPRS 
PPFIITCGEIMNVFSDM 


5572 


19473 


A 


5615 


385 


3 


KPPPGDPGKTPFFLKSQKITGGGGGAPL 
FPPPQKVKAKKRGYPRRQSFKRARLGPC 
P PS RGGKKKAG F KTNP P T KKKNQT KKPM 
KAKTS KLLQENRSGPGT VAHACNPS TLG 
GQGGWIMRSGDRDTK 


5573 


19474 


A 


5616 


218 


402 


RSYS FSQL VGTVI KARKPESS CWLCAPN 
RKNAFWPGAVAHTCNPSTLGGRGGRIMR 
SGDGD 


5574 


19475 


A 


5617 


2 


235 


KSFGCASRLQMHGRTHTGEKPYKCKQCG 
KAFGCPSNIiRRHGRTHTGBKPYKCNQCG 
KVFR C S S QLQ VHGRAHC I D T P 


5575 


19476 


A 


5618 


347 


421 


I YLXXKKKKKKKKKKKKKKQKKKKK 


5576 


19477 


A 


5619 


92 


1 


ENLALLPRLECNGTILAHCNLRLPGSSD 
SP 


5577 


19478 


A 


5620 


3 


402 


APPPSFFFKKGLIMAISPTLGAIKLINF 
L FF F KKGKKKKKP PLE KKKKKKKKKGGA 
PLKKKKFSPPGGGRKIFFKGAPKKISRA 
R VKKQRRGE KTRGNP KKKRG KKKRPF PN 
TVSR 


5578 


19479 


A 


5621 


420 


3 


HPPPQGSANFFPPPRKIGINGPPPPAGI 
IFVFLECRGFPQVGQGGFKIRPPNKPPP 
PPPKNLGLQKKSPPPPPIFFFFFLWRWG 
LMTLPRLVSNSWLKQSSHLGLLKHWDYM 
S E APHLAKNFLKE YE WS L S FHVEGS 


5579 


19480 


A 


5622 


246 


132 


EVAPHLLGCISPTMCVCLCVCVCVCVCV 
CVCVCLCSW 


5580 


19481 


A 


5623 


219 


424 


I LNVMVLG I GTFGRLLGLEGRALTNG I K 
AIj I RRDMG P GAVAHT CNPN I LGGREGW I 
MRSGDGDHPGQH 


5581 


19482 


A 


5624 


32 


454 


GL E KVH S MTXDRNS RL S VQLRDWDGGGE 
LLQFSVHLDGEDTAYSLQLTAPVAGQLG 
ATTVP P S GLS VP FS T WDQDHDLRRDKNC 
AKSLSGGWWFGTCSHSNLNGQYFRSIPQ 
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QRQKLKKGIFWKTWRGRYYPLQATTMLI 
Q 


5582 


19483 


A 


5625 


166 


2 


LFGLVKLTALKGLKMILPSQVWWLTPVI 
PTIi WE AKAGGS QGQEFE TS LANTVKP 


5583 


19484 


A 


5626 


120 


406 


GEAATQENLAELRPEPELLSPSTVLSRE 
PELPSPSTVLSREPELPSPSTVLSRKPD 
LLSPSTVLSRKPDLLSPSTVLSRKPDLL 
SPSTVLSRKPDLLSPSTVLSRKPDLLSP 
STVLSRKPD 


5584 


19485 


A 


5627 


324 


16 


LSPPPPLFKTAPRPQKKNPPPKKKKPAP 
PKKKNLFFFFFFFFLVFFFIWCSWRLAE 
WNETLGECDMRWGQAVGTGTSRELRPGG 
SRSQPRRLAGKELGPHCAP 


5585 


19486 


A 


5628 


378 


72 


GRGGERPPPPQKSGAKKKPGVLNFFPQR 
GKRSGPPGPPGFFKIFSQEAPGFFSSFF 
WGRSIiALFPSWGARGGFLVPANLGPPAG 
APKGNPPSKKKKEKIED 


5586 


19487 


A 


5629 


3 


204 


QESCLNMGGRGCSEPRSCHCTPAWATEK 
DS VS KKKKKW KFGG F YE V* I GNFLHL FP K 
WSYPPSFRIF 


5587 


19488 


A 


5630 


177 


1 


SFHEMRFWKLIFI IRIYVHFERRSCPVP 
QSRVQWHDLGSLQPLPLGFKRFSCLGLL 
SNW 


5588 


19489 


A 


5631 


154 


3 


KWFQMGSRSVTQAGARQCDHSSLRPPTP 
QLKRSSCLSFPGGWDYRHVPPC 


5589 


19490 


A 


5632 


239 


17 


EGVRNYLILQPRSLCRLNCSFFPNREKE 
LCHHS S F S S PLAAAKE SQGE KGRLL S QD 
EGLLLWEVFVEDVEINS 


5590 


19491 


A 


5633 


2 


110 


GCS E PRSHHC I PAWVTEGDS VS KKQP P P 
QKKKCIAG 


5591 


19492 


A 


5634 


433 


0 


FFFFFFFFFFLKKI PPPEGGGF 


5592 


19493 


A 


5635 


306 


433 


FSIS IERSSSGRAQLLTPVMPALWEAEA 
GRSRSQEIKTILAN 


5593 


19494 


A 


5636 


2 


285 


Q SRT S V I FNS E KAE RGEE T TE E KFE AS R 
G WLVKFKERS CLHN I KVQSEAI SADVEA 
AASDPEDLiAKI IDEDGYTKQHI FGQALC 
LMSNPTTLGG 


5594 


19495 


A 


5637 


160 


1 


L YCGK I YWKF I RLISTFFL S QGLTLS P KL 
ECTGTITAHCSLKLISPSNPPARAA 


5595 


19496 


A 


5638 


3 


261 


KIQLNPGSQGCSGXiSLCHCSPAWATETL 
SQKKKKKNPPFGFKIKGEIEPLREFGTK 
KKKGGFLKKFP PHPLGFFLE P FTWGGF W 
N 


5596 


19497 


A 


5639 


2 


121 


S ELRS CHCTPAWRQS KTTSHKKKKN I KK 
KNKFFLPFREN 


5597 


19498 


A 


5640 


370 


180 


VEMEFCHVDQSGLELLISGDTPTSASHN 
AGITGVSHRTWQPSFFSFFKERRNLKSV 
TSLNVSP 


5598 


19499 


A 


5641 


254 


1 


FY I LRKVF VFL F P AVS YGKGT Y F AVDAS- 
YSAKDTYSKPDSNGRKHMYWRVLTGVF 
TKGRAGLVTPPPKTTPHNPTDLFDSVTNN 


5599 


19500 


A 


5642 


292 


106 


NFVILFYLFKQGSHYVAQAGLEPLSSSD 
PliATASQRAGITSVSHRAQTKQAErYW 
PFSLFT 


5600 


19501 


A 


5643 


3 


100 


DTFFEVEVEiyiEKEVCRDLMCTSPKDEEG 
FIjLI 


5601 


19502 


A 


5644 


1 


282 


E KVPE AKRL YGKRGD P F YE AQENHNL I G 
VANVFLECLFCDVKLQYAVPI ISQQGEV 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q-Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *-Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














STGQPWRTMWQVFCFGLFWFS FYLSYV 
LEVDQETKAN 


5602 


19503 


A 


5645 


142 


67 


NKNRPGTVAHACYPNTLGGQGGQIT 


5603 


19504 


A 


5646 


350 


213 


GLELLTSSDPPASASQSAGITSMSHRPW 
PPLASRLDLCCGTLYLI 


5604 


19505 


A 


5647 


3 


180 


EHNAE RAFLAQAHLGPGAP CTPGFPSFI 
PTQFPPFRSSGLPNIPVQTISRAAAEKL 
FG 


5605 


19506 


A 


5648 


366 


1 


RSWFTTYKERSLLHIIKVQREAASADIE 
TAASYSEDLAKITHESGYPKQQIFNVDE 
TAF YWKMI PLS T C I AREEKS I SGFKAS K 
DRL T LWLGANA I GD I KLKPM I VYHF KT P 
RALKNYASSL 


5606 


19507 


A 


5649 


1 


355 


QIQSSQSGKGLKLTSFIDEETKLFFFGN 
KVWFCI PPGPGGKLCFKKKKI FGLYPGG 
GGFWGPQTPPLAPTFPPGFPPPPGPNFS 
PRGPFPKMGPALRGPGKKKKPPEKTKPA 
KGGTGP 


5607 


19508 


A 


5650 


348 


124 


RAPPVFFWVFLGQFPKKKSTNSPRGGGP 
PRSFQPFSLPPFFRNVFKKKPPPFFFEG 
GRGPPAPPFKNPFFFFSRL 


5608 


19509 


A 


5651 


194 


43 


KKKFFFFFYEMESHPVTQAAVQWHHLGS 
LQPPTPTFERFSCLiTLSLIAIP 


5609 


19510 


A 


5652 


27 


244 


KKQQSTLFLFYFLSFILFFFLVKTGSHY 
AAQAGLE I/RAS SNPNTS VS L P KS WD YGH 
EHTVPSPTWFLAAKTP 


5610 


19511 


A 


5653 


38 


429 


GLFLSCLFFFFLEKGVPLTPPGATGENP 
AP KKKGTGQKLL KNRDF F LKKAM FGKQ K 
QKKFFFAPTI TGDQRAPRVFKGPGAQKE 
PFAQKKKERGGGKKTRAAPNKKKKPGAF 
FSLKKTPLWAGGGPPPFF 


5611 


19512 


A 


5654 


391 


68 


NLYPHGGVPGVSPGKGSLPKAPNPLGGI 
PPSGEPPPLjPPQGFFPCLKPKKRLGKAP 
GPFFFFFIAILLGWFQISNTDS CVQVFI 
IQTIWII FTT I I KGCHLHFTFLF 


5612 


19513 


A 


5655 


159 


34 


GARFLFFSPVLSPLIEHCVCVCVCVCVC 
VCVCVCVCVLCACMCT 


5613 


19514 


A 


5656 


88 


388 


SIFCFIYVFFLRQSFTLVAQAGVQWRDL 
GPLQSPPQKKKGKETFSFLGENRYLQPK 
NFPTFLTQVCFFEKIPPNGLINGKRPKW 
IPGCEFLKTLWGPW 


5614 


19515 


A 


5657 


176 


371 


HFYFCFSDINLAAEPKVNRGKAGVKRSA 
AEM YGS VTEHPS PS PLLRSGTLL FITAL 
CPSVGIFSF 


5615 


19516 


A 


5658 


3 


113 


CSEPRSRHCTPAWATERDS VS KKQQTKN 
LSYKKIHW 


5616 


19517 


A 


5659 


409 


67 


SSSNKPPTRPVFILPFFGHGVPRLPPGK 
KPPGPPPGPPPPQVEQMYGRKGGFALNL 
PPRRQNKNMLYSPELGEK 


5617 


19518 


A 


5660 


2 


377 


GRIS CSRNTPLRHRSIML I VCGLRMPLS 
GFLP WVGQVFLAL I PVNPQPS S LGVRT I 
MDMLRCCRDFIYGQFKTGQFMARLWGACS 
QQYTEFFRFKIIFISSLLFNIVLEVLAR 
AIRQEKEIiMGMLQ 


5618 


19519 


A 


5661 


256 


381 


NFNTGQAWWLMPVI LAIWEAEAGSSRGQ 
E IET I IANTVKPHL 


5619 


19520 


A 


5662 


19 


313 


LYSSIAILGFLPLL INFI I I I I FFFFER 
DPGQQNQNFVSLKKRETPPPPFSLGVPG 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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NO: in 
USSN 
09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaJanine, 
G-Glycine, H=H\stidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X—Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














KRSLLPPPPGPKHAGPPVLPAGIiWGGPE 
PPTWGRGAKQKREI 


5620 


19521 


A 


5663 


78 


365 


E TGRKRKE TRWNL KKVCLL KM S LQGE KE 
ERKREKYRRTEMKRQKNSEKQRGREKGK 
KS KYQRRRKKE KKGENQ T KP KP KVEE S S 
SRQ 


5621 


19522 


A 


5664 


159 


28 


DTSTLPTPTGFKRFFCLSLPSKWDYRRP 
PICLPNFFFFLWTGY 


5622 


19523 


A 


5665 


350 


151 


IRCLCPDKFLAFCRDGVLLCCSGWSRTP 
GLKRSPCPSLPKSWDHRLCLADIYIYIF 
LKDTSTTMSF 


5623 


19524 


A 


5666 


394 


248 


LTPGGGGCGKLRIiRHCTPAWATRANLPL 
KTKKKRKRFLFTLAETSKVY 


5624 


19525 


A 


5667 


1 


194 


RLNPGGRGCSELRSRHC I PAWATRTKLH 
L KKKKKGDQG FRGDHTKN I WL F PRVFL P 
LHSLNTLK 


5625 


19526 


A 


5668 


2 


312 


RGQNSARGI 1 1 IAI LKKVCVY I YRCGLT 
LSPKLKCRGTI IAPCSLELMGSCDPSTS 
ASQIARTADMHHHTRLIIASLTFCLLKK 
KKNVFCGTYNHCPPKGFYL 


5626 


19527 


A 


5669 


1 


245 


RLNPGGGGCGEPRSSHCTPVWATRVKLS 
LKKKKKKRPPFFFFGIRQPPQGGPFQTL 
TPLNLLKNLTTIiLGAKKFFFFTGGK 


5627 


19528 


A 


5670 


1 


135 


ARTPSPSLSESSEDEKPTKKHKKGKALR 
LKRRFWWLMSALPCIH 


5628 


19529 


A 


5671 


97 


2 


KKFTHVWPGWAHACSPSTLGGRGGRIT 
RLN 


5629 


19530 


A 


5672 


362 


173 


VFSPPPPFKTPPPPPKKIPPPKKKKTPP 
PKKKKLFFFFFFFFFFFFFFGTRIVIDN 
ILSGNRV 


5630 


19531 


A 


5673 


237 


13 


SPGGGGGFAPYWMFFLDFERQSCSVAQA 
G VQWHNL S SLQPSPPRFYFFS RDGVKNR 
PVPRYAPPSPKGISLRPYK 


5631 


19532 


A 


5674 


245 


14 


ANFLKAHIVSTLGLQI IWSLLQIiLNFAI 
VF YFLRQS FALVAQAGVQWRD I GS LQPL 
PPRFKRFAGQRFSWEAEVAVS 


5632 


19533 


A 


5675 


2 


249 


SHPTHSHPQHLPLTPTWNWSSTPVDFIF 
RKAPPVFPHWQHHRAGPGTSLIPTDAPQ 
EASPPPPYPSSPSSTPTIPRETFLRQ 


5633 


19534 


A 


5676 


115 


241 


SSTLGGQGS ITLAQELRSCHCTPAWATG 
QDSISKKKKTLLGP 


5634 


19535 


A 


5677 


460 


2 


RLGVTGRWQKLKPQERMRAWRENGKGPK 
ALi PAS WEGP TVNLL WKHEHRAW VS CHCD 
SGFCAWCQGLGSEQSGLPSLIPSWDGLV 
LTSPPNPTGPPPPLKTRSVLRGDDVLLP 
CDQPSNLtARALWLLNGSMGLSDGQGGYR 
VGVDGLLVTDAQ 


5635 


19536 


A 


5678 


2 


346 


ENCLNSGGEGCS KLRLCHCTPAWVTERD 
S I TKEKKKKNFKP VLE KFGE I FFLKGWK 
FFQLNP LKS SKEME FFNTTLPLE P CFNG 

TSTVDTPTaTTI "A XT T Ti7T>T ■N.TTTTP/^T'^VO O T XT' VTH/~* On 

JN is-PJf Wi\AJS.X W X LiiN Jf J? U 1 yj\Gb J. r KWGGR 
KK 


5636 


19537 


A 


5679 


181 


4 


P KE VRT L FGRF F F F FHME SHS WQ AGVQ 
WCNFSSPQPPPPGFQKFSCLSLPISWNS 
RHM 


5637 


19538 


A 


5680 


549 


410 


F F VQTD F CHVAQAGLQ LLGS SNP PAS AS 
PKYWDYRREPVCWPSSGF 


5638 


19539 


A 


5681 


111 


403 


SRQINGFFWSRTKVAIGQFLVGPSVKI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 
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NO: in 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I^lsoleucine, 
K=Lysine, L=Leucine, MKEVIethionine, 
N=Asparagine t P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FENNNEICMI KS I QS Y I VLNHKHLKVYS 
SNIWQLKNILLRGGAVAHACMPS ILGGR 
GRRITRSRDRDHP 


5639 


19540 


A 


5682 


189 


3 


lSnSTNRKASWKKKDRVLLYCPGCNWCDLGS 
LQPLPPGFKRFSCLKLPKCWDYKHEPLH 
PAPHV 


5640 


19541 


A 


5683 


2 


227 


IHVCVCVRVCVC I LREDS AQPARCVCVC 
VCVCVCVCVCLDPRRELSPTSQLESS CH 
PLYVPSGGSRINRPYPGGC 


5641 


19542 


A 


5684 


318 


421 


DVPCFPVAFCCDCKFPEASPAMLNCESM 
KPLSFI 


5642 


19543 


A 


5685 


213 


398 


SLCCLLFCCCYFETESRSWQAGVQWHD 
HS S PQPGP PRLKS SSHLS LWRHDHS S PQ 
PGPPRIi 


5643 


19544 


A 


5686 


398 


1 


S FP TS S FYS S S VQGGGQGAKS YKSCTAP 
TAQPHDRPLTASSSLAPGQRVQNLHAYQ 
SGRLSYDQQQQQQQQQQQQQQALQSRHH 
AQETLHYQNLAKYQHYGQQGQGYCQPDA 
AVRTPEQYYQTFSPSSSHRE 


5644 


19545 


A 


5687 


2 


288 


SDRPRSPSFSFPSSPAGVGRASSIFPIL 
SAI LiLLLGG VC VAASRVYKS KRNI I LGA 
GI LFVAAGERQREGATGAAHLGAHVCVC 
VCVCVRARARA 


5645 


19546 


A 


5688 


314 


2 


CRPGS QLQELGDGGGGE S FVGGHGQQHP 
GAPQRGRYLQPLLMCGASPHASPRQPLA 
ILAAGVKFRVLRHEAGRPLRLLMQINPL 
P YS RWHTYRLPS CGCL TC I 


5646 


19547 


A 


5689 


122 


3 


MGGGEFSKKjynsriNGQGWAHACNPSTLG 
GRGRQITRSGV 


5647 


19548 


A 


5690 


144 


1 


E AVKKYT C P T WAGAVAHACNP S TLGGRG 
G W I T R S RGRDNNG KHGE I PR 


5648 


19549 


A 


5691 


377 


65 


LGVFP P FFFLRGEGLS P P P PGGBVKNQF 
FFPPGQKGPPGTPERGFQGFLGPKFFPP 
LP PQGPKG I FFLKKKKKKKI P I CALAVL 
TYQHFCYRTGQERVDGLRELi 


5649 


19550 


A 


5692 


12 


147 


YE RR F CH I GQ AGLE LiLT S GN P PAS AS RG 
AE ITGVS YHALMRNIS 


5650 


19551 


A 


5693 


183 


395 


GHRFGFLHPPPPGFGGFFRPPPPKTEGK 
RAPPPPPRFFFFLGGGGFSGGGPRGVLF 
LEHENPP PGAP kgvg 


5651 


19552 


A 


5694 


177 


283 


THFLNRTDGRR WS IASL P S S GYGTNTP S 
STVSVSA 


5652 


19553 


A 


5695 


266 


460 


LSQRKYAFFFFFFFFGKKIFFSSPGGEI 
RAQFFLTEPSPFWVKGFSPLPLPKTWDY 
IGRDPPPLG 


5653 


19554 


A 


5696 


1 


412 


MQPLKVNLLEQKNLGLEFVAPFTWIHCP 
SYWS IAYNVYFSQYSFIWVFCTMFITVU 
IPS KKKKKKKKKKKKKKKKKKS S S P G S P 
GYAPKKAPKNKKKK 


5654 


19555 


A 


5697 


379 


0 


FFFPSQKKKKKKKKKKKK 


5655 


19556 


A 


5698 


1 


187 


PTRPPTRFRFENGVNLGGGGCGEPRLCH 
CTPAWVTERDS VS KKKKKKKKAVGGP WF 
FQKKKV 


5656 


19557 


A 


5699 


374 


0 


DFLLQKKKKKKKKKKKKKKKKKKKKRGG 


5657 


19558 


A 


5700 


219 


413 


PFCPQGGGGGPNFVFLGPPPPGLKEIPP 
P P P RGGGKKKP P P P P RGGF F F F KKKGVS 
PGGPGGSPP 


5658 


19559 


A 


5701 


381 


249 


FHRAGQDGIiDLIiAPGSAPLGLPKCWDYR 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, F— Phenylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S—Serine 7 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














REPPRPAFFFFFFF 


5659 


19560 


A 


5702 


382 


197 


LANFCIFSRDGILPCCPGWSGTPGLKQS 
AHLCLPKCWDYTHKPPCPAPQVIKPSIN 
ILIVN 


5660 


19561 


A 


5703 


1 


381 


ERGS KHPPRGLKH I RGR T C F VLKGCM I Q 
LCWPPGQGKKKKEGGGEKKKDKKREEE 
GRE KKKNGNQGERVKKECKS KRKKKKRS S 
SS 


5661 


19562 


A 


5704 


291 


167 


NGPPFFFFFLVKMGFCHVGQAGLEFLTP 
GDPPALASRLLGL 


5662 


19563 


A 


5705 


367 


23 


KNPCPPPPPEFFLAPKKKGPPPRKKTRF 
VPPPLFLIKGPPFKRGKGPPPGGFPPLF 
C P LGG KRGG VF L S P RVGKP RGKGGNP LF 
PKKKKKKKRRRRKKEGKKGRKERQGYIH 
TV 


5663 


19564 


A 


5706 


411 


116 


FLKRWGSHYAAQAGLELLDSSDLPVSAS 
QSAGTTGVTHCSRLSYSALACQLCQPVW 
RI P WRGRGVGRE T S FRG WGRR YGRAW S S 
SGGGAESLPGLSPG 


5664 


19565 


A 


5707 


162 


2 


SYCFPNSDDFIPFQPLPHTWFLWPRPFF 
I FKTAHGWARWLTPVI PALWEAEAG 


5665 


19566 


A 


5708 


1 


416 


LPLLYKIPPPPAILFLFIKDSPPPPPPP 
PFFFFFFPPP 


5666 


19567 


A 


5709 


403 


142 


LSKEKKWGWPRGFSPRIFEKPPPPKXFF 
APPKKKIFPPPPGEKKFFFKGAPPPFFF 
FFFSFFFFFFDCYNSSTWNSAWHIRCSI 
SIC 


5667 


19568 


A 


5710 


339 


461 


PSTFILFI LRRS FALVAQGGVRWRHLGS 
PQPPPPGFKRVPA 


5668 


19569 


A 


5711 


408 


42 


TGARI I FSGPPKKTYPSPPRAFI ILS FK 
RAAPQL I YFYLNLFAYKRLVKLTTVIMR 
RR PTFFLiLLF KT FL F VE PG F CHVAQAGL 
QFLGSSTCGSLPQCWDYRCEPQHLAKRS 
TFIKSESAV 


5669 


19570 


A 


5712 


277 


9 


gvcvcvggwvppmglqtpsapwglsla 
pplgtlcsvqwlaesihlcicqalve.pl 
rrql yq vp vrklllass i vsgfgds iwn 

GSYLR 


5670 


19571 


A 


5713 


411 


0 


FLQILLFLIDIAFFLVFFFFFWKK 


5671 


19572 


A 


5714 


205 


400 


HYLIQSSSSMASESVQLDWMKIPLTWP 
GAVAHTCSPSTLGGRGGRITRSGDGDHP 
GQHDEAPSL 


5672 


19573 


A 


5715 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5673 


19574 


A 


5716 


176 


392 


GVHLVLEWKLSMVCAEDDELFPLCSWKK 
FFFFLFLRQSRSVTQAGVQGRNFGSLQP 
LPPGLKRISCLTRPSS 


5674 


19575 


A 


5717 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5675 


19576 


A 


5718 


306 


42 


GS GLRR CALNS VKCLNTLE C I LLGI YLR 
QIKICVYTKTCTCKFIVTLFIIAKKYEY 
PSVHQLINKMWHIQI IFLVIKRNEGCGG 
SCL 


5676 


19577 


A 


5719 


1 


263 


PTRPGRFFLILGDRGCSELKSCVCTPAW 
VTEGDSISKQNKKQVELRSPAAHLLGGL 
VPNRPQASTRIGDPCTKKLLELISKFSK 
AAA 


5677 


19578 


A 


5720 


3 


278 


NQEILKNVESSRTVQPHFLEFLLSLGWS 
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Predict- 
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nucle- 
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last amino 
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peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lrysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibfe 
nucleotide insertion 














VDVGRHPGWTGHVSTSWS INCCDDGEGS 
QQEEVISSEDIGASIFNGQKKSMHIIKQ 
IKYFLSH 


5678 


19579 


A 


5721 


127 


3 


KKKKKKE EE EQGEGRRRRRKKKKE KE KK 
EKE EDGE GKE KGE 


5679 


19580 


A 


5722 


204 


96 


VTSRSSLFSTPGIKRASCLGLPKCWDYR 
PQPPHLA 


5680 


19581 


A 


5723 


3 


261 


QIVRQWKYNDDDDDQLFYTRVYLDPGLR 
VGKAEEWAHPAALWEWRPQAEWDLGSKG 
GC KE KNVCLG WAHACH PRLLGGIjGRR I 
A 


5681 


19582 


A 


5724 


134 


358 


PPPPPPPXLLFFVSAVPPSPSPSLSFPP 
HRPCYSPPSPTLSSLPPPLLFPPPPSSP 
PPPPRPSAPSPPPSSPFFS 


5682 


19583 


A 


5726 


248 


470 


LLNVQNGIRNVCSCPHYTPFHKWKYLS 
VFLTE VDSSQLRRQLCGGS QAAI ERMIH 
FGRE KVQ I LHLAL I YFLV 


5683 


19584 


A 


5727 


1 


281 


NQHNRG FFDSCDGF FTN YNWRE EHLE RM 
LGQAGE RRAD VYLG VDVF ARWNVVGGR F 
DTDKGGGGFLSKGQRPSAS PGTS FPHGP 
SIPVCTSKE 


5684 


19585 


A 


5728 


460 


3 


SQTRTLTLDREITIjPSSGTALISLVDGS 
GNPVSVEVQSVTDGVKVKVSRVPDGVAE 
YSVWELKLPTLRQRL.FRCVS IREMDDGT 

YAI tavqhvpeke ai vdngahfdgeqs g 
tvngvtppavqhltaevtadsgeyqvla 
rwdtpkggpvtv 


5685 


19586 


A 


5729 


70 


465 


sqcsphhsvlcqkvqdlesgtlqsnwkk 
itpwdenvcvcvcvcvcpcvsmcvyvcv 
yvclcvcvcvtvcmcvcly i kgllflla 
kketagsffhsqltclgvpcaftihpvp 
flpqcrshcvsggrfwpgvl 


56S6 


19587 


A 


5730 


281 


427 


S WL TAAPNFQLQME SRS VAQAGAQ WHDI* 
GSLQAPPPGPQSAETQKQMIV 


5687 


19588 


A 


5731 


418 


194 


TKFHHVGHAGLELVTSGDLLSSASQSAR 
ITDMHHCSWPASVLSMNRPSGSTRCLRN 
GIFGPLKDFKPNDLTVQL 


5688 


19589 


A 


5732 


229 


2 


PGGAPPLVRSGFPGWGGAPPFVFVPNGE 
PPKNFFFFFDGVSIiCRPGWSAVAGSQLT 
ASSTSRVHAILLPQPGIVA 


5689 


19590 


A 


5733 


435 


138 


PPRYPPPLWGAPPHLSRGGGSFTPLTPQ 
GKPRFFLKIKINPGGGPPPYSPPFGGGG 
PRNFFSPGAKGPLVLISPLFFPPGDQKK 
NSFFKKKKKERERE 


5690 


19591 


A 


5734 


113 


409 


KDVCGGVLTGLSGVLTSPEYPNNYPNSM 
E CHWV I RAAG PAHVKtiVF VD FQ VE GNE E 
CTYD YVAVLGGS SPTPVHQYCGS I SAPY 
DFFLAKMIMSYSLPN 


5691 


19592 


A 


5735 


3 


122 


GGWQVQFVSEPCSLS INGVIFGLTSTDL 
LFHLGAEEISR 


5692 


19593 


A 


5736 


14 


428 


FLiALRWGHIRLPASGPRDTATLFSTLDT 
QLLMTLYVGKDETGFYVSKALVHTGVAL 
VPRGLTLAPADGPTTDEVTLQVSGEREG 
SPSTAVRYPSGSVALPSQWLLIGHHELP 
P VLHT TMLR VHP TLGS GTAE TR P PEN 


5693 


19594 


A 


5737 


415 


1 


QHINTLGQFWTTKAFLPRMLELQNGHIV 
CLNSVIiALSAIPGAIDYCTSKASAFAFM 
E S LTLGLLDCPG VS ATTVLP FHTSTEMF 
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QGMRVRFPNLFPPLKPETVARRTVEAVQ 
LTQALLLLPWTMHALVILKS I LPQAA 


5694 


19595 


A 


5738 


335 


152 


GVEMG FHHVVQAGLE FLS S SDP PAS AS Q 
SAGITGVSHQASPFFLPLSGITLPLSIY 
SSGSD 


5695 


19596 


A 


5739 


35 


462 


IDASDFGECSQAAINDNWIVYELLEEM 
VDNGFPLATESNILKBLIKPPTILRSW 
NS I TGS SNAGDTLP TGQLSN I P WRRAGV 
K YTONEA YFDAAEE I DAL I DKSRS TVFA 
EIQGVIDACLKLSGMPDLSLSFMNPTLL 
DD 


5696 


19597 


A 


5740 


429 


325 


NPGGRGCSELRSHPCTPAWVTEQDAILG 
GRSGRIT 


5697 


19598 


A 


5741 


344 


423 


F WPGAVAHACN P S TLGGRGGR I TRSR 


5698 


19599 


A 


5742 


431 


103 


RGPFFPRPRQKKGGSPGGGPPQNKKNGP 
PLHTGGAKGGSFFKKKKKKKVPKKGPGL 
F FPG P QG PR KG PVGAPPVL I PNRLGP PR 
IPPGWEPGRQIFFLAPVFFFPSKPK 


5699 


19600 


A 


5743 


31 


224 


E EMG F YHAEQAD FELL S S1STD PAALiAFQG 
AGIASMSHCSWPGKNFNMFCCIWKMKRL 
AIHWECIA 


5700 


19601 


A 


5744 


'217 


400 


HSVMGGSPSFFFFFFFFFGEKGLFFFCP 
GGPNLGKGNPP P P GKKGFP P P P P P KGGN 
NGAPP 


5701 


19602 


A 


5745 


4 


455 


DGLEFPGRRFRGQRCSSPPRRGRGQVEA 
LLTS QTGRRGRGAPHVS DDGRRGRDAPH 
FLDGL VAGKRRS S L P I WDGGRAE TLLT F 
QTGQPGRGAPHVPDDGRPGREAPHFPDG 
VAAGQRLQS RHFGGQGRRLGGGGC I EPR 
SRHCTPAWAP 


5702 


19603 


A 


5746 


381 


212 


PRGGGGGRYSPPPGGGGRGNSFYPGGGG 
LNKPHFGPCPPPGGPKMKPPPQKKKKSL 


5703 


19604 


A 


5747 


334 


432 


MFCENYKE IAWPGAVAHSCNPNTLGGRG 
RHITC 


5704 


19605 


A 


5748 


190 


1 


IPPKKGGLNFFSPPTPKKGVKSKPPQGK 
VFFFLRRS FTLVTQHGVQWRDLGS PQP P 
APGSKHE 


5705 


19606 


A 


5749 


425 


297 


ESAFHHVAQAGLELLGSCDLPVSASQSV 
GVTGMSHHTRLYCS 


5706 


19607 


A 


5750 


425 


0 


KKKQRKKERKKERKKERKKERKKEKKRT 
KSDE 


5707 


19608 


A 


5751 


136 


3 


AQSSRPAWLIFLFLVETGFHOTGQAGLE 
LLASGDLPTSQVPRRP 


5708 


19609 


A 


5752 


3 


237 


GCSEPRSRHCTPAWVTEGDSVSKTOKQK 
TQRGLKS IALFPAIHSLLPWIKLAMPI 
LAMD I FPHP I VE FS YHVWEQM 


5709 


19610 


A 


5753 


1 


358 


GDRGCS ELKS CHC I PAWVTEQDLPS KKK 
KKKKGGKKTRNGP QTCGGDLP P P PLGGG 
KNFPKGETFWFHSGKKKRFFSLGKGEKC 
FFIPRAPKGPGKTLGGPNRGGGAKKKLK 
RPPGPVY 


5710 


19611 


A 


5754 


2 


191 


QGCSELCSYHCTPAWETRARPVSKKKKP 
KT KNQTKKNRLGKKP PNL I FP P FQKKKK 
WQIPGK 


5711 


19612 


A 


5755 


42 


408 


KHAAP PAS LS LSLLLHHGQKRAC F P FAF 
CRDCQFLEGSPAMLPVQPAKLTPRSTPP 
HP CSLENDSAHFFN 


5712 , 


19613 


A 


5756 


179 


410 


LQAFLRPS PPVLFLL YADLSNQNLWPGM 
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VARACNPTTLGGRGGR I T 


5713 


19614 


A 


5757 


166 


2 


GRPGPADFRVRPQXjLQRFLFIYLFTEME 
SCSVTQAGVQWCNLiGSLQPLP PGLPD 


5714 


19615 


A 


5758 


334 


3 


EEKVFGLPSPILLSKGQGGAASGPPCPI 
S PRAMGPQKKGWE TMVF I TRAP PS FGF V 
LVKQSPGTGPVPRNFFFFLRRSLALSPR 
AGVQ WRDLGS MQAP PPGFTP F S CLNL 


5715 


19616 


A 


5759 


373 


470 


SS CWSLRNS S PGAVAHACNPNTLGGRGG 
RITR 


5716 


19617 


A 


5760 


210 


384 


EALDTTIFFFFFFETKFCFVPQAGGQGQ 
NLGSLGAPPTGLTPFSCLTLRKTWDCGP 
PP 


5717 


19618 


A 


5761 


257 


400 


SLGSEYTWVCFCVFETESRSVGQVGVQW 
CNLSSLQPPPPGFKRFSCLR 


5718 


19619 


A 


5762 


400 


2 


ARAVGS VGEDPGLGTGPS PARE TWE WAV 
VCRACRGSQLLWGKLFILRHWDLRKVLA 
RGNGTEDCLKQLWSQCGAGMGRGPFQGL 
KGKPVGCLCSSREVFYLFLYLFRDRVSL 
CHPGWNAVKQTRLTAASASQ 


5719 


19620 


A 


5763 


34 


440 


RVRAP LGWE E GN YF VLRAKP R S GKRS CL 
PFLRCGHQEPIiLWGISTASPLRHLVGAS 
QICGVLFLCFQVSLKPAGSWKWIGNXTG 
ERKAQRLCPLRATRVEGIFCFRFVFLRW 
S FTL VAQAGVQ WRD LGS L Q P P P L 


5720 


19621 


A 


5764 


297 


468 


LVGAGVPTGCGQRNGWNSHKSVQAQWL 
MPVI PALWEAEAGGSRSQE I ETILANMV 
T 


5721 


19622 


A 


5765 


1 


205 


GFKLTDS FAS GAKAI LRS QLLGRLRQEN 
RWNLGGGGCMGLKWHNCPPAWAIEGNLL 
FKKKKKKGGGTF 


5722 


19623 


A 


5766 


307 


1 


KERDPPPTPFGKTFFFFFFKRNFCFFPP 
KEGQGG I LGPRHP F FSRLS FLGRGVPRG 
FPPGPRNFFFFWILVKMGVPHVGQADFK 
I)L T S GDLNKTAS QNAGI K 


5723 


19624 


A 


5767 


2 


319 


LLGNGEELSRFPFQRSSQAWVFESSAGI 
FGGKAGATGLE KLTGS FQQLTGHPD PRT 
PEE LGD P E KS S AS E KTRGLQ E ANGVP E I 
LKAVTYTQAWPKEAKVDPITP 


5724 


19625 


A 


5768 


787 


1118 


EAAWRELEAERAQLQSQLQREQEELLAR 
RKAEKEQLSEE IAALQQEHDEGLLLAES 
EKQQVREPWRGLCCSLSCSSSGAGPCSA 
T WQLGALG Q ATAL LGAS VS L PAGGE D 


5725 


19626 


A 


5769 


2 


452 


NGAGTMSVSLEADENPFAQGALKSEDCF 
ILDHGKDGKIFVWKGKQAKTEERKAALK 
TASDFITKMDYPKQTQVSVLPEGGETPL 
FKQFFKNWRDPDQTDGLGLTYLSSHIAN 
VERVP FDGGNLDTFTAMAAQHGMDDDGT 
GQKQIWRIEG 


5726 


19627 


A 


5770 


2 


130 


GTQGKIVDLVKELDRDTVFALVNYIFFK 
G KWER P FE VRD WKS 


5727 


19628 


A 


5771 


422 


8 


HIARWGARAPQGGWALLKAELCLNPAPP 
PIGARTAPLWPGCCFSLRPVFLLLFPFF 
SLFIQSFRILYPQPWPLQGRRSTTLSRM 
GTTDHV I VLAS TNRAD I LDGALMRPGRL 
DRHVF I DNPTLQERRKCMRRLGAS RN 


5728 


19629 


A 


5772 


432 


3 


YSLNLKKRWFVLTHNSVHYYKSSEKSAL 
KLGT WINSLCS WPPDEKI FKETEYWN 
VAVYGRKHC YRLYT KLLSEAS RWS S AI Q 
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NVTDTKAPIDTPTQQLI QDLKENCMTSH 
WEQ I YKRKKI LRYTHHP C I AAAVEDHD 
ND 


5729 


19630 


A . 


5773 


20 


432 


SRAAALLE AVTETL F YYE VAE KI WSNRA 
NRQ CADCGS SRPDWAAVNLGWI CKQCA 
GQHRALGSGISKVQSLKLDTSVWSNEIV 
QLF I VLGNDRANRFWAGTLP PGEGLHPD 
ATPGPRGEFISRKYRLGLFRKPHPQ 


5730 


19631 


A 


5774 


433 


2 


RGAFQT LQ KKDLQAF VLS S VQLQ VL TGS 
CF KLRT VHN I P VP SNKVDD E CP GL YG FL 
HVIVHFAKGFKQSATLSCALEVDSFGYF 
VSKAKTRVFRDTAEPKWDEEFE IELEGS 
PSLRILWYEEYWITKSKVNNLYSAAVGFS 
AWQ 


5731 


19632 


A 


5775 


133 


436 


MLISLPHPSTHLTSLPFCADFGTRQTTG 
AASHVliTIiSAHSSVCSCPPLMGEATTSR 
AQDLPADEHTAFAHELEAPALS KGQGLS 
ILRALTGHLPLGQEVN1 


5732 


19633 


A 


5776 


397 


58 


KGT I S I GKLE E KLRGAARQ AL ADAI I E F 
QL L PAS L CTED T P TGMQ VKRS IjGSRE CH 
RS KQD S VAGE RAKVGFRG S T S EN Y Y I RT 
TYSQLSVSLSGHHPGATLAAVNGCSCSP 
T 


5733 


19634 


A 


5777 


1 


324 


MRRCVSAGGVELEEYYPAFLDMVRSMLE 
GSIDPTQYEDTLREMFTIHAYVGFTMDK 
LVQNIARQVSRAGVGPALRTAEMALPSF 
HSPLGRGPWSTANLSCFFLIETK 


5734 


19635 


A 


5778 


1 


398 


RLGSRPSLHDQSPLELRSEIQELKGDVK 
KTVKLFQTEPLCAI QDAEGAIHEVKAAC 
REE I Q SNAVRS ARWL FETRS LD VIHQD P 
SQVRLIRGISLEEGARPDVSATRWIFDT 
QSLDAIREILVDEKDFPPSP 


5735 


19636 


A 


5779 


281 


12 


VACNLGWRVRTPTPASYPPCVPQLSVRG 
KNRKELLGGFLRNIVKSADEALITGMSG 
LKEVDDFFEHERTNLLEYHTLYCGRSED 
HATSR 


5736 


19637 


A 


"5780 


406 


23 


DSQATGPGISRGRRISSPSSLCTRFSTS 
IVKCSPLEHVLKYLDPPPPPTRCSRPAS 
LRS S P KHC VQQQ FP X ACRPGAAAP PP P P 
AQCPNPGPAAASARHEPKWGSRTEGGPA 
RARRREGTGRRKKRR 


5737 


19638 


A 


5781 


415 


18 


TSQAGDDLiLHGSLLRRGPEMGYLPGYDP 
D P TLAT P P AGQTLAVP S LP RATE P GTG P 
LTTAVT PNGVRGAG P S APE L L T P P PGTT 
APPPPSPASPGPPLGPEGGEEEPPPTII 
TTTTVNTTVTSPMYCGRSEG 


5738 


19639 


A 


5782 


168 


432 


GMRRQTGLD QQ S RG VGKNWRVD F F FP S S 
LSFFFFFFGKESYFVLKFGIHGLKFGYIj 
EPLSPGLKEFSRLSPPKGGHLKCRPPSP 
LFFF 


5739 


19640 


A 


5783 


1 


423 


TDDDLNWLDHSRTFREQGVDENETLLLR 
RKFF Y S DQNVD S RD P VQ LNL L YVQARDD 
I LNGSHP VS FEKACE FGGFQAQ I Q FGPH 
VEHKHKPGFLDLKEFLPKEYIKQRGAEK 
RIFQEHKNCEKISERSAKV 


5740 


19641 


A 


5784 


273 


400 


DSWLLMLS PGVCPHVQVWP I SCTLGAT 
FGYVAGLVISPLWI 


5741 


19642 


A 


5785 


445 


28 


DTNRAPNHME I KGSGGTEAKVNGADS ID 
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KEAAGDETMEARTMEAEATGAKPTGVEA 
TGAKVTETKPTGAEVREMESTEEEANME 
S KPTGAQ ATDTE TTG VE AMGVE AT KT KA 
BEAEMQAYGVGAGQAEPPVTGTVLRPL 


5742 


19643 


A 


57S6 


380 


2 


QRQQES P EAS S LH I LERQ VQELQQLL VE 
SQEEKESLGREVESLQSRLSLLENERGN 
TSYDVTTLQDEEGELPDLPGAEVLLSRQ 
LSPSAQEHLASLQEQVAVLTRQNQELME 
KVQ I MENFE KDE C I 


5743 


19644 


A 


5787 


663 


2 


FGVERRATRTIRLTRTVLDLYSFLAGVS 
ENLRHATQDDAS RTRAPGL S S QH P KP D T 
TVSGDTETGQSPGVFNTKESGMKDIKSE 
IAEVKDTLKNKSDKLEELDGKVKGYEGQ 
LRQLQ E AAQG P T VTMT TNE L YQAYVD S K 
XDALREELMEGMDRKLADLKNSCEYKLT 
GLQQQCDDYGSSYLGVIELIGEKETSLR 
KE IMtf LRARLQE P S AQANC CDS VY 


5744 


19645 


A 


5788 


398 


1 


TRC YHS ALRYGGS FAELGPRIGRLQQAQ 
LWNFHTGS CQHRAKVLPPLEQVWNLLHL 
EYKRNYGAKRGGPP VKRAAEP PVVQPVP 
P AAL S G P S GEE GL S P GGKRRRG CNS E QT 
GLPPGLPLPPPPLPPPPPPV 


5745 


19646 


A 


5789 


116 


371 


RHPKIWGLRGPPGLTLEKRGKGEEPLDP 
RSWGFKEAETPPRPFTRGKKGTPRLKKK 
P KERKTRERGGKKEGKKKGKKNEGKTNP 

L, 


5746 


19647 


A 


5790 


406 


284 


RRGFFHVDQAGLELPTSGDPPTLASQSK 
PGPLCPALLHTF 


5747 


19648 


A 


5791 


200 


3 


GDALIYMEPEKQVMSRSSDECVVALCDQ 
WliVS YS KKI PLVNLLTFDLKKNI S I CPL 
PGTWIMEMY 


5748 


19649 


A 


5792 


372 


278 


LQIQSPAGQSRCGGFLVREDFVLTAAHC 
WGR 


5749 


19650 


A 


5793 


1 


167 


VAP P KLD PH SGRVRAQAGL ELLTLGDPP 
ASSSQTAAITGASHHTRQVLGSYKALS 


5750 


19651 


A 


5795 


112 


3 


SSWGKVGAHAGEYGAEALERMFLSFPTT 
KTYFPHCI 


5751 


19652 


A 


5797 


356 


238 


FLFCNNGVYSLEHFRCIiAENAGDDAFVK 
DVTVLQNTDGR 


5752 


19653 


A 


5798 


379 


60 


VKIRHCHTMVSMSLLKNIRLILEEEQVE 
GNYSFCVLDNQNLQQIiWDWDHRSLTIKA 
GKMYFAFNPRLCVSE I YRMEE VTGTKGR 
QSKGDINTRSNGERASCE 


5753 


19654 


A 


5799 


377 


3 


KVAFPGEGRQGESWSAFQGHTRWSDLK 
WAVYEPDLMVTSSVGTYIYIWDIKDTRK 
LT VAL F WAGAS Q VKWNKKNANCLATSH 
DGD VR I WD KRKP S TAVE YLAAHL S K I HG 
LDWHPDSEHNLY 


5754 


19655 


A 


5800 


143 


316 


CPTSPPRLPYTEFFGGVSGLTVEQFRKI 
NGFPNAF WGWGGE DDDLWNR YL CR S FGV 
F 


5755 


19656 


A 


5801 


390 


2 


LFFFFFFFLRQSLTLVPQAGVRWRDLSS 
LQPPPPGFKRFSCLCV 


5756 


19657 


A 


5802 


342 


465 


VIFFFKLKQGLTLSPRLECSGTIITHCG 
LDLLGSSESPTFT 


5757 


19658 


A 


5803 


1 


378 


VYSSEFENIKEEYRGRGFPTICYFEKGR 
FL I Q YDNYG S TAED I VE WLKNP Q P P Q P Q 
VPETPWADEGGSVYHLTDEDFDQFVKEH 



892 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=As parti c Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methiomne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SSVLVMFHAPWCGHCKKMNPEFEKASEA 
LHGEADSSGVLAAV 


5758 


19659 


A 


5804 


152 


1 


KPE I FFFESRSVTQTGVQWCNFGSLQP P 
PSGFKLFSCLSLPSSWDYTAHV 


5759 


19660 


A 


5805 


207 


31 


LYLQLLQQTASSGNLNTLSSLHPMGGKC 
FTAAEEQQHPGQQTQLTTVLGWHRFCE 
NIP 


5760 


19661 


A 


5806 


199 


2 


GSECSCLLSRGGEMHVSLSSFKVTFSCD 
AIIHFSFLFFWRQEFPSVQAGVQWRDLG 
SLQPPPPGV 


5761 


19662 


A 


5807 


385 


239 


QDQPGLQKMCVCVCVCVCVCVCVCVIIC 
KS CIRLS YMKMP INFGLKNL 


5762 


19663 


A 


5808 


39 


184 


GRI I I KFFGEMRSHHAAQTGLELLGSSD 
PPTWASQSAAI TGVS LRIRP 


5763 


19664 


A 


5809 


376 


2 


WAEPTSFHVCITSYTQFFRGLTAFTRVR 
W KCLV I DE MQRVKGMT E RHWEAVF TLQ S 
QQRLLLIDSPLHNTFLELWTMVHFLVPG 
I S RP YL S S P LRAP SEES QD Y YHKWI RL 
HRVTQPFILRSV 


5764 


19665 


A 


5810 


271 


120 


FLCFIGKTSERGSFSLYSRDTGLPGCQV 
SLMIKKSNEMVFKHINKGSINR 


5765 


19666 


A 


5811 


367 


3 


DKS FKE WHKMLEDANAP TEERDER I AGL 
YEEIERDLMLLGASAVEDKLQEGVIETV 
TSLSLANIKIWVLTGDKQETAINTGYAC 
MMLTDDMNDVFVIAGNNAVEVREELRKA 
KQNLFGRCI 


5766 


19667 


A 


5812 


294 


37 


FRGENPDDGVRGS P PED YRLRQVAS SLF 
RGEHHS RGGTGRIiAS LFSSLEPQIQ P V Y 
VPVPKVSHWAFFPNSSLEPLTYKAIFLP 
TA 


5767 


19668 


A 


5813 


184 


3 


I IS YLKNYLLFSPLQRQNFVLLPRLECS 
GT I TAHCNLKLVGS GDP PAPAS Q VARTT 
GMRH 


5768 


19669 


A 


5814 


468 


1 


DDMEYEGSVSVFTQVPRKSWKGLDYTL 
MVAGE FGLGKSTLVNSL FVSDLYRDRKL 
LGAEER I MQT VE I S KHAVD I EE KGVRLR 
LT I VDTPGFGDAVI S TECGKRRAM Y 


5769 


19670 


A 


5815 


24 


314 


QAPFGPKCFNVNLCFSLSLSKGEPEFHY 
IAGAHGNEVLGRELLLLLVQFVCQEYLA 
RNARIVHLVEETRIHVLPSLNPDGYEKA 
YEGVMAIiAHLTQ 


5770 


19671 


A 


5816 


474 


3 


TTLRKGYSRHKGLLHS IRQEVQATPLEG 
IIVSGQMSTMNLSSTLWPSPKRVRRLCI 
GRTLRSLRIRPCKRTPLQTRRMPQKMRA 
LYGDPGDSPPLRSRAIPASSPGSTGSFR 
ARPPHPLHPPPHLTREGEPPLLPVLQPR 
RGPPLPLHSQASRPIV 


5771 


19672 


A 


5817 


511 


389 


GM YCTEGRE WPT FRNE I E I EEDHCGRV 
SASIILTHCSNR 


5772 


19673 


A 


5818 


510 


2 


PLSCEVPNPLFPVPGVRMIQHHIEACAQ 
VRGMQE I VL I GFYQPDE PLTQF I EAAQQ 
E F I LP VRYLQE FAP LGTGGGL YHFRDQ I 
LAGSPEAFFVLNADVCSGFPLSAMLEAH 
RRQRHPFLLLGTTANRTQSLNYGC I VEN 
PQTHE VLHYVE KP S C I AAAI EDQAS SRQ 
R 


5773 | 


19674 


A 


5819 


282 


152 


FSLFLFLLQSERHFIMQWCEATQCPDT 
RVSVRLYEKSVIiHLS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
. correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, K =Stop cod on, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


5774 


19675 


A 


5820 


347 


1 


EPHPVTPPAVGTSRYNPLIMMDRNRSAV 
TPPSHPPQQPSSMQTGMNPSAMQGPSPP 
P P P P S YMH I PRY S TN P I T VT VS QNLP S G 
QT VPRALQ ILPQIP SNLYGSPGS I YIRQ 
TKV 


5775 


19676 


A 


5821 


482 


13 


IYEKLITGCYNILANHADPNSGLDESIL 
EECLQYLEKQLESSQARKAMEEFFSDSG 
ELVQ IMMATANENLSAKFCNRVLKFFTK 
LFQLTEKSPNPSLLHLCGSLAQIiACVEP 
VRLQAWLTRMTTS 


5776 


19677 


A 


5822 


608 


7 


FRFAEKWEGMFI IVNSITIKIHSKAFH 
AS FELWQLQGYS VNPNWQQ SDLRLTR I T 
DPCRGEVLTFKE ITWQTLRIEADATDNG 
DQDPVTTPLRLITNQGRIQIALKRRTKD 
C NVI S S KLMFLLDDLLWVLTDS QLKAMM 
KYAESLSEAMEKSAHQRKSLAPEPVQ I T 
PPAPSAQQSWAQAFGGSQGNSNSSSSPV 
LRPL 


5777 


19678 


A 


5823 


328 


3 


AEVASEDCSLPCFLAVWNRIIEPVAAMR 
KEADMLRLFPEYLKGEELFGLTVHAVLR 
IAESLPGVESCQNYIiFRYGRHPLMELPL 
MINPSGCARSEHKILTHYKRPSLY 


5778 


19679 


A 


5824 


69 


308 


TSVPSCVRCRYIILRTSSALTNILGTHS 
NSSFFHASSSALHSCCFPFFSWLQTLDI 
NVKAPALMT KA WP EMS KRG YRE 


5779 


19680 


A 


5825 


424 


2 


LQRAFS ESI RKLRGYQGADRKQ I YHRRC 
S FANHS VRPS ADEKCNSNFFEQRHGGSH 
QSSKWTPVGPAPSTSQSQKRS'SGIiQSGH 
SSQRTSAGSSSGTNSSGQRHDRESYNNS 
GSSSRKKGQHGSEHSKSRSSSPGKPQTV 


5780 


19681 


A 


5826 


107 


2 


S S LTAGVRMGVPAQS TQGTVNGS S PQMS 
GTAALTS 


5781 


19682 


A 


5827 


2 


437 


FPTEDSRTSKESMSEADRAQKMDGESEE 
EQESVDTGEEEEGGDESDLSSESSIKKK 
FLKRKGMTDSPWIKPARKRRRRSRKKPS 
GALGSESDKSSAGSAEHIGPCDSTGDME 
VSSGLPGSRRPENPVFLCIWVAVRAICP 
GFIaWS 


5782 


19683 


A 


5828 


487 


22 


MPEPVFPLSHFRQFIAAIKLQFQARLSR 
CVRDLVRSLAAPNHDTLRMLFQHLCRVI 
EHGEQNRMSVQSVAIVFGPTLLRPEVEE 
TSMPMTMVFQNRWELILQQCADI FPPH 


5783 


19684 


A 


5829 


2 


871 


RG I CSQRWRREGS Q S RGPGLVI TS P S GS 
LVTTASSAQTFPISAPMIVSALPPGSQA 
LQWPDLSKKVASTLTEEGGGGGGGGGS 
VAPKPPRGRKKKRMLESGLPEMNDPYVL 
SPEDDDDHQKDGKTYRCRMCSLTFYSKS 
EMQIHSKSHTETKPHKCPHCSKTFANSS 
YLAQHIRIHSGAKPYSCNFCEKSFRQLS 
HLQQHTR I HS KMHTETI KPHKCPHCS KT 
FANT&YLAQHLRIHSGAKPYNCSYCQKA 
FRQLSHLQQHTRIHTGDRPYKCAHPGCE 
KAFTQLSNLQ 


5784 


19685 


A 


5830 


463 


1 


LPESRLPSPHKREEGSRARVIMTSYPIE 
PHERRKGSLADWATLKQKKLEEMTRTE 
QEDSSCMEKLLSKDWKEKMERLNTSELL 
GEIKGTPESIiAEKERQLSTMITQLISLR 
EQ LLAAHDE QKKLAAS Q I E KQRQ QMDLA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, ^-Phenylalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
KF=Lysine, L—Leucine, M==Metnionine, 
N— Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan s Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














RQQQEQ I ARHQQLY 


5785 


19686 


A 


5831 


462 


2 


MILFDSS LYFLCGW YLSNL I LGT YGVRK 
PWYFPFTASYWKSVGFLVEKRQYFLSSS 
LFFFNENFDNKGSSLQNRBGELEGSAPG 
VTLVSVTKEYEGHKAWQDLSLTFYRDQ 
I TALLGTN GAGKTT I I SMLTGLHPPTSG 
TIIINGKNLQTVY 


5786 


19687 


A 


5832 


396 


2 


VP S TPAS KRKG I VPRCDIF I YDRGGALP 
KD I HVAG I L FASAWMNWPLLS AI E FHS 
AWAMGMGVNFFVAS THHVSLNMSGSGRY 
APNGRKVYHYDMKTEMGELLLSEVDSHP 
LSSLAYPTADNWNAYATLY 


5787 


19688 


A 


5833 


384 


137 


EE I CLNPGAGGCSE PRLWHCTPAWATKR 
D S S KKKE KNCL S KNKKEKE KE MFAC VC V 
CVL FS I YAFPS KAQE LSNP CKHYGiSTF 


5788 


19689 


A 


5834 


171 


2 


FVEG I N I SGNF YRNKLKYLAFLRKRMNT 
NPSRGPYHFRAPSRI FWRTVRGMLPHMY 


5789 


19690 


A 


5835 


278 


404 


SVL I KFEKKTLKYKAI WPGAVAKACNPS 
TLGGRSRGITRSGE 


5790 


19691 


A 


5836 


417 


3 


S KGKEGRCEWQ YDLPS GTHAVFQFFLKQ 
QGRVPGLTQAVQVAKMSLIDLAGSERAS 
STHAKGERLREGANINRSIaLALINVLNA 
LADAKGRKTHVP YRD S KL TRLLKDS LGG 
NCRTVM I AAI S P S SMTNEDT YSTHV 


5791 


19692 


A 


5837 


409 


230 


AGRAYCYNGMCLTYQEQCQQLWGPGKAL 
LGSTALPSPPIiPSQPPIiAIjRWWNLGFK 
WHL 


5792 


19693 


A 


5838 


3 


409 


LRS VP CKDYLTQWHYI TS PLS EEEAAFP 
IiAYVMVIHKDFDTFERLFRAIYMPQNVY 
CVHVDEKAPAE YKES VRQLLS CFQNAF I 
ASKTESWYAGISRLQADLNCLKDLVAS 
EVPWKYVINTCGQDFPLTTTRPV 


5793 


19694 


A 


5839 


385 


174 


GliAVEIGSRRIAEDGLELLASSDPPTSA 
SQSAGSTGVSHHAWPDTVSSVYWCFLCC 
T KQVWG ALi S IV S L 


5794 


19695 


A 


5840 


260 


2 


PLPR YTAACSRLL VQ YKAAFRQ VQGSE I 
SSIDEFCRKFRLDCPLAMERIKEDRPIT 
I KDD KGNLNRC IADWS FY I TVMDKLRQ 
CI 


5795 


19696 


A 


5842 


299 


3 


FSGIKYI C I IFLFVCFFL YLLGIGS P YV 
AQAGLTLLGSSDPPPSASQSPGIIGVSH 
YVQP IHT VAQLS PSSI FRTFSS FQTET V 
PIKHTSHSPSPSLY 


5796 


19697 


A 


5843 


391 


2 


APHAP AFLRARGEPQDP LSHPRVP AVS A 
NCRMWKHLPVHSSPTPRLTPLWKLQARW 
LLPQIjVYLQGWGSYSIiLRPAAL I SMVLL. 
AREFIiYPAKMSVSEVCSSGLSSPLLEQH 
KOTL I FYASGD I CSANV 


5797 


19698 


A 


5844 


479 


59 


F VGME PG VGHASLAMHGLH I RSLGK I KLi 
RKRKCTLF FNTQE KS ARRRGHLLGEN I Y 
LLL FA I ALR I LNCLL VQTS F VPD E YWQS 
IiE VSHHMVFNYG YLP WE WTERLRS YAYP 
L I CAS I YKVLHLLGKDS VQMLVSGNKS S 


5798 


19699 


A 


5845 


406 


548 


SVQTKPFFETESHPVTQAGVQWHNLCSL 
QPL P CRS KQFL CLtRLP S SW 


5799 


19700 


A 


5846 


2 


394 


TL CHRAKD P T VHHE S GR I M YLNGLC FLM 
GPAQIiTQRLSVSRQGEC 


5800 


19701 


A 


5847 


472 


201 


LSILSDTSLCLGRFFTYENGCAYFHEEE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 
beginning 
nucleotide 
location 
co rres pond 
ing to first 
amino acid 

peptide 

oCLjUCIlLC 


Predict- 
ed end 
nucle- 
otide 
]ocsttion 
correspon 
ding to 
last amino 
acid 

I CSIU UC Ul 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny!aJanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K = Lysine, L— Leucine, M^IVIethtonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
± — inreuiiinct v — vanne, 
W-Tryptophan, Y=Tyrosine, 

V^:T In Irn Awn ^' — r% iririnn /snncrihlo 
Si. — t_>llrvlHJVrII 7 — olUJJ CUUUfl, /— (JusSILrle 

nucleotide deletion, \=possible 
nucleotide insertion 














NKKP V I YRS AP AR P GLGQ S L CNQVMW YQ 
VAILNK 


5801 


19702 


A 


5848 


127 


3 


S SNVKAAWGKVGAHAGEYGAEALERMFL 
SFPTTKTYFPRCI 












i 


LF S P E I SLDVDADRDG WE KN1STPKKV P S 
FQGRHHPEPRASEDTAPDPAGMY 


5803 


19704 


A 


5850 


423 


2 


LS P LGNWS S ANQKWFDS VKP SRKDLFA 
VDTQVGTVPSLTAGEQGFGDGGKGRAGE 

PQPLLTGPQGPCLiCRPLPNQQTSFPALL 
PFSTQLCSMCPGPASARPPPLIjLKPTMY 


5804 


19705 


A 


5851 


326 


682 


T) O "V O T T O T T r\tr"f TJOTTT'VT AADT/TT CMV 

rb x bijijbijJbUV J^JiVJiKljFKyyKl\]ifaMK 
QKMEEHTQKKQLLVSPWPQLGSAGAGAR 
PGMVTRPPLCLCPI PAPQI PRS EYLNIiA 
SSFMAPLSPPPCSLHRLSCLHLEVRNSI 
SGYPNCI 


5oV5 


19706 


A 


5852 


263 


30 


Hit-ill DJJJiKGPCjybAKfaGAJ. IaPPGPPJjP 
IEPHETTPEHPAPSGTIPEPPLPVELHE 
TTPQHPVPSGTIPXPPYLLSQ 


5806 


19707 


A 


5853 


389 


2 


GLPTQREKFGASMRTRMTIQS iawtsf 
VCLVGEGNNVQGFRAESRCWRYDPRHNR 

RG YHND LNAVE R YD PATNS WAYVAP L KR 
RCMPTKAKRWRGRCISP 


5807 


19708 


A 


5854 


379 


2 


GRSLRYSGSCSGEENSTNNSAGQSRAVI 
AAAARRRGNSHNEYYYEEAEHERRVRKR 

VMDPREAAQAI FASMARAMQKYLRAAKQ 
QNYNTMESILQHC 


5808 


19709 


A 


5855 


374 


2 


SDAGAPVNIYEFRHRPQCLEDTKPAFVK 
ADHADEVRFVFGGAFLKGD I VMFEGATE 
EEKKLSRKMMKYWATFARAGNPNGNDLS 

T Ta7"D 7\ "VNTT TTTOVT <~\T nTTJMCT r*ADT EfT? T}"D 

VEIWItfSTIPPCX 


5809 


19710 


A 


5856 


516 


18 


P YE CKE CGKAFNCGS SLVQHER I HTGE K 
PYECKECGKAFSRGCHRTQHQKIHRGET 
P I KCKE CGKAFS WGS SL VKHER VHTNE K 

LYQRKEFGKTFTHGS KLVHERTHSNDKP 
YKNKE CGE AFL WTT YSNE KC IASAK 


5810 


19711 


A 


5857 


392 


3 


CERGMGGSPRALGRHWTSFLKLRLNCSV 
PGDS TFYFDVLQALTGPVNLHGRSALFG 

KFKEQRSLDGAWTPVSEDRVPSPRPGSC 
AGVGGAAKFTSSRDLLY 


5811 


19712 


A 


5858 


391 


3 


ARRTTGMVPKAGGGKGRRGAVFRS Y IRE 
IEELRSKLVESEAMNESLRRSLSRASDR 

t> ir x o J_jl_ri-i.C3 ir^u\i^J^.r v.r'oro fJr\C3 o L v i Ji iJiAib ii V 

IRRAKQDLERLKKKEVRQRRKS PEKEAF 
KKRAKLQQENSEETNGE 


5812 


19713 


A 


5859 


409 


3 


SLPGEGDPWWAGQEELLFVQEGKIiSLPV 
AERVLLR I ACR YDPRSNS WAE IAPMKJSTC 
REHF VLGAME E YL YAVGGRNE LRQVL P T 
VERYCPKKNKWTFVQSFDRSLSCHAGYV 
ADGLLW I SGGVTNTAQ YQNRQC I 


5813 


19714 


A 


5860 


405 


135 


NLSRLSLCRQPMTLVTWVLLFYDSEEKA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K— Lysine, L— Leucine, M=Methionine, 
N=~Asparagine, P=Proline, 
Q— Glu tain in e, R=Arginine, S=Serine, 
i—i nreonine, V=VaJine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, ^ r =Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FWLLVAIj W KRML PD YYNTRVVGE CPRVL 
PRLGLQVPAGSMRVAALPRPSSRLHVGS 
LiGVRG 


5814 


19715 


A 


5861 


389 


2 


YYSELEEQLTDEFNAELNRVPLKRLDLI 
F VT FQD S RMAKRVRKD YKY VQ CGVQP QQ 
SSVTTIVKSYYWRVTMAPHPKDI IWKHL 
SWRFFWWARFIAINTFLFFLFFFLTTP 
AIIMNTIDMYNVTRPCI 


5815 


19716 


A 


5862 


525 


0 


SHLPESERIHHTVGQKEQVLMDTSKTRP 
NNDVPEPPMPIADQVSNDDRPEGSVEDE 
EKKESSLPKSFKRKISWSTKGVPAGNS 
DTEGGQPGRKRRWGASTATTQKKPS I S I 
TTESLKSLIPDIKPLAGQEAWDLHADD 
SRISEDETESKGDDGTHDKGLTIC 


5816 


19717 


A 


5863 


399 


2 


ATRRNRNRVPSGMTRTNVREMIAAVG PG 
PSPYPLPPPPEGTSSIEYSNQGNTCQGH 
GNFDFPHGNPGGTSMNDFMHGPPQLSHP 
PDMPNNMAALEKP I SHPMQE TMPHAGSS 
DQPHPSMQQDKARNTPQPMY 


5817 


19718 


A 


5864 


3 


714 


RRP F I AL CL SNVAFML P WQ FAQ F I IjFTQ 
IASLFPMYWGYIEPSKFQKIIYMNMIS 
VTLSFILMFGNSMYLSSYYSSSLLMTWA 
1 1 LKRNE I QKLGVS KLNCWL I QGSAWWC 
GTIILKFLTSKILGVSDHICLSDLIAAG 
IIiRYTDFDTLKYTGSPEFDFMEKATLLI 
YTKTLLLPWMVITCFIFKKTVGDISRV 
LAT3WYLRKQLLEHS ELAFHTLQIiLAFT 
ALAILILRLKLVL 


5818 


19719 


A 


5865 


423 


1 


AP P VS TAVAQ SNS SEE E ARE VG S PAQE F 
KYQKSLPPRFQRQQQQQQQEQLYKMQHW 
QPVYPPPSHPQRTFYPHHPQMLGFDPRW 
MMMPSYMDPRITPTRTPVDFYPSALHPS 
GIiMKPMMPQESLNGTGCRSEDQNCVPPC 
I 


5819 


19720 


A 


5866 


497 


2 


AVGAGQKGGRGGGGRELMPPQKIWGGGA 
FTNNAHVGPLKIMPRLIRTGMLEEIHLE 
PGWQGLFPCVDELSDIHTRFLSQLLER 
RRQALGPGSTRNFVIHRLGDLLISQFSG 
HSAEQMCKTYSEFCSRHSKALKLYKELY 
ARDKRFQQF IRKVYCGR CRGSQQGR 


5820 


19721 


A 


5867 


382 


74 


LALSPRLKCSGQI IAHCSFDLLGSSDPP 
PSASR V AG Tu. GAkRHARL P Y S LRHYHFV 
LRLFNPSKHSKLKHFHPILIRNPGKVGS 
YS FQMSDLLQSRRARQQS 


5821 


19722 


A 


5869 


378 


3 


SFSRSANLISHQRIHTGEKPFQCAECGK 
SFSRSPNIiIAHQRSHTGEKPYSCPECGK 
S FGNRS S LNTHQG IRS GE KP YE CKE CGE 
S F S YNSNL I RHQR I IiTGE KP YKCTD CGQ 
RFSQSSALITPV 


5822 


19723 


A 


5870 


375 


3 


IHVIVFWQATGHVMAKRVFDTYSPHEDE 

-rtlYlv LiE IjINIu V AJroK.Vi-iX<- 1 v ivLJ iivjror xiLixv 

DTAKALLRS LGSQAGPALGWRDTWAFVG 
RKGEAECHWADTELNRRRRRFCSKVEGY 
GSVCSCKDPTV 


5823 


19724 


A 


5S71 


373 


1 


QPBEVSGAIiS PPSAS AYVKLVLLGL IMC 
VS LAGNAI LS LMVLKERALHKAP YYFLL 
DLCLADGIRSAVCFPFVLASVRHGSSWT 
F S AIjS c ki vafmavl f c FHAAFML F CIS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
□9/515,1 
Zb 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














VTRNMAIAHHLY 


5824 


19725 


A 


5872 


382 


2 


KSFSQKSQbJIHLRTHTGERPFSCPECG 
KAFRE KS T VI IHYRTHTGE KP YE CNE CG 
KAFTQKSNLi 1 VRQKTHTGEKT YE CAKCG 
ESFIQKLDLIIHHSTHTGKKPHECNECK 
KTFSDKSTLIIPHV 


5825 


19726 


A 


5873 


370 


3 


MGRVRAQNISGVMSGPQKMLMQSQFPTQ 
GQQGFCEGKEPYQAMSQNMGNTQDMFSP 
DQSSMTMSNVGATRLiSHMPLiPPASNPPG 
TVHSAPNRGLGRRRSDLTISINQMGSPG 
IGHLNSTTCI 


5S26 


19727 


A 


5874 


362 


2 


GGKFLVLGNLPSKIiEESMVQYYRljV'TAA 
SLiVRGQISEYNISLRASDGGSPPLSTET 
HITLHVIGINDNPPTFPHLSYSACIPEN 
NPRGAS I FS VTAQDPDSJNJ>JJNAR I TYALT 
EDTLQGVY 


5827 


19728 


A 


5875 


369 


1 


R I RPRPTARLASARTLHEVS LQES IR YA 
PGDAVEKWLNDLLCLDCLNIARIVSVCP 
LPEACDLSYVNRDTLFWCHKASEVFLQR 
LMAL YVAS RFKNS PNDLQMLSDAPAHRL 


5828 


19729 


A 


5876 


119 


39 


VIQDYTTPPNEELSRDLVNKLKPYMR 


5829 


19730 


A 


5877 


461 


2 


RRGWT S RRKP KE DPS GAAVP EMP KKS S K 
IAS F I P KGGKLNS AKKE PMAP S HS G I P K 
PGMKSMPGKSPSAPAPSKEGERSRSGKL 
SSGLPQQKPQLDGRHSSSSSSIiASSEGK 
GPGGTTIjNHSIbSQTVSGS VGTTQTTGS 
NANSVQLPQPLCI 


5830 


19731 


A 


5878 


503 


3 


PSPQVPGECPSPKKLGARAAFTTPDPAP 
LSPQSRVASSGSEQTEEQGSSRNSFQED 
GSGMKDVPSWLKSLRLHKYAALFSQMSY 
EEWTLTEQHLESQjSrvTKGARHKIALS I 
QKIiRERQSVLKSLEKDVLEGGNLRNALQ 
ELQQI 1 1 TP I KAYS VLQAT VAAATLY 


5831 


19732 


A 


5879 


421 


34 


LVHKVL S ASNVLVDAEGT VK I TD YS I S K 
RLADI CKEDVFEQTRVRFSVNAMHYRTG 
NKGDVWRLGLLMLS LS QGQE CAE YPVAI 
P S GL PGGFQDFLKKC VCLDDKKS MES P A 
WETKQNKS PANVLRPL 


5832 


19733 


A 


5880 


379 


3 


AQGLLLAWPNLNRQHCPCSCSHNLSTFI 
QNWKSPGIiSLKTRAPGQGSVLSSSDLPR 
AVLTGT FAVMS VMVGS VTE S LAP Q ALNG 

f* ft /I "TTvTTTirn"5V TITA'A 7\ TIT 7VVTT7V 01X11* fit TT T T/~1 T *P~1 /~\f r 

S M INE TARDAARVQ VAS TLS VL VGLFQ V 
WNSQEYVPPTTCI 


5833 


19734 


A 


5881 


429 


1 


S LiVS YMDTE S TAE KLGKE LGLQAKELS A 
VHSSHHEIGVNDSNLFSLEMREPLESSN 
TKSFHSAVEIKNRSQHELPCFQHHGIDT 
PTSLQKRSACCPPSLLRLEETASSSQDG 

T\T">/"\-rT»TT'\T\T nnfTHOT "hTTT'OT IT1TTT TT» f*» TP X "A T rrn 

QCI 


0o54 


19735 


in. 


COO') 


456 


5 


AVHSSHHEIGVNDSNLFSLEMREPLESS 
NAKS FHS AVE I KNRS QHELP CFQHHG I D 
TPTSLQKRSACCP PS LLRLEETAS SSQD 
GDPQMWDDLPFSESLIKFKTVLESSIAV 
TQCI 


5835 


19736 


A 


5883 


195 


353 


DS YS YVR S TAPAVAYDS KQ Y YQQP TATA 
AAVAAAAQ PQ P S VAE T Y YQTGG F S 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, P= Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon, /=possibfe 
nucleotide deletion, \=possible 
nucleotide insertion 


5836 


19737 


A 


5884 


432 


1 


LSAKVPVASVSDQAAAMHLSQCPKNLAT 
SIAELRTASQKAHKLCGPMEIDSALNTV 
QTLKNELQDAKMAAVES QLKPLPGETLE 
KCAQDLGSTSKAVGSSMAQLLTCAAQGN 
EH YTG VAARE TAQALKTLAQ AARGVAAS 
TTDP 


5837 


19738 


A 


5885 


313 


I 


GKSPHLLILQTSPLSKAPQPLILLLLYW 
SWGWSVTAASRETEAGIQVSSEEPGP 
RNVS PHSRLQDETNLRLEAENNIAAYRQ 
VREVQGRDREGRKGLMATLY 


5838 


19739 


A 


5886 


432 


3 


GSGFCSPIL CLW WVEE AVG 1 1 RLHLWLR 
ELGCWHSFSLPGTYSLEFLSTCTQVLPY 
SWSGLLFASGNYLSLYLVPSRAETDSRG 
SQPLPVARHGVGACKCKPRQTRPGLS PQ 
VCVEKLMPLS S FCS AFQQNTYNKQPMSR 
PV 


5839 


19740 


A 


5887 


417 


2 


AS LM VAP DYAE ISP LAM PAP S S G WC T P 
IMSTSSSEAMSTPLMLAPDSGELSPILM 
QDMNPGVMSTQPVPAPSFEAMSPLQITD 
EDTEAMS KVLMTAIiASGE I SSLLMSGTD 
SEAI S SLIMS AVASGGTSPQPTSTLY 


5840 


19741 


A 


5888 


420 


3 


KFDL I KKLDLKTM S S YDLD I E ASDGGGL 
SGKCSVSVKVLDVNDNFPELS ISSLTSP 
I P ENS PETE VALFRI RDRDS GENGKM I C 
S IQDDVLFKLKPSVENF YRLVTEGALDR 
ETRAEYNITITITDMGTPRLKTEQSV 


5841 


19742 


A 


5889 


321 


414 


FNMR I WPGAGAHACNP S TLGGRGGR I TR 
SAD 


5842 


19743 


A 


5890 


415 


1 


PGLPGTS VKGI PAS KQS PHE S PRTLHLK 
TSPI IQQLGLYLSHTAIRYHPQETLKEF 
VQL VC PDAGQQAGQ VGFLNPISTGS S QDKV 
HNP FLP TPMLP P P P P PPMARP VPLPVPD 
TKPPTTSTEGGADYPTSPTYSTPSLY 


5843 


19744 


A 


5891 


193 


3 


KPSSKVWSRRDPFLLSSVEQPIKDAVIT 
VPVFFNQAERRAVIiQAARMAGLKEMQL I 
NDNTACI 


5844 


19745 


A 


5892 


412 


2 


KHQVSDNKDSFYVSLYPDFSSLSRAILD 
WQFFKWKTVTWYDDSAGL IRLQEL I K 
AP S R YNLjRLK I RQ L> PVD T KDAKP LL RE M 
KRGKE FHVI FDCSHEMAAG I LKQALAMG 
MMTEYYHYIFTTLDLFALDVEPSV 


5845 


19746 


A 


5893 


413 


3 


ELLLCSNTSCCRCFCVECLEVLVGTGTA 
AE AKLQE P WS C YMCLPQRCHGVLRRRKD 
WNVRLQAFFTSDTGIiEYEAPKLYPAI PA 
ARRRP IRVLSLFDG IATG YLVLKELG I K 
VGKYVASEVCEES IAVGTVKHEWR 


5846 


19747 


A 


5894 


424 


3 


CSGRREPSVRGVQCKGDQGHHSARMAPS 
EAPGTRS CTPSHGQNTAAEATPAQKTPA 
KWyVFSTEMANKAAKAVLKGQVETMVS 
FH I QH I S S S KTERS TAPLNTQ I S AIiRND 
PKPLPQQPPAPASQDQNYSQNTRLQPCI 


5847 


19748 


A 


5895 


400 


1 


ASVQNPALRLVTREEFAIMQTPAGELYD 
KSIIQSAQQDS IKKANMKRENKAYSFKE 
QI IELELKEVSTRRHLRKVAAELGGLKL 
VAFPTAHMIiFTWNILRPMAHSSDAPDQC 
LVHWHTFPSPLLHS FFHPQLY 


5848 


19749 


A 


5896 


421 


3 


VSS IQMGAVNLLGGGLDSLLGSDLGGGI 
AGS PAVGQSF I P S S VQATCAPS P TPAW 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
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S SGLWDIiS ELS TW IGMAHGG YVAPKAVW 
LPAVKAKGLESSGTFTRRQGHIYMAMDF 
TNKALQHMTDS AI QFTKNS FGVIPS C I 


5849 


19750 


A 


5897 


384 


1 


PRASRFIHEAGIVPLPMRTHTPKKPFTC 
EVAGVQFSRNVKVKIHMRKHTGERPYSR 
QSCSVRFVHSYDLKNHMHLHTGDRPYEC 
HLCHKAFAKEDHLQRHLKGQ I CLEVRTR 
RRRKDDAPPHYPPPCI 


5850 


19751 


A 


5898 


429 


2 


QDIAAWQSLTQVLTPESWRKANIMTEPQ 
KSQERYKGIYVKEKLYRRARHDESLNWT 
SCDHHESQECKGEDPGRHPNCGKNLGMK 
S TVEQHH WHVLPQPFTCWNCG VAFADD 
TDPRAHPSTHLGEKSYKCDQYGKILSQS 
LY 


5851 


19752 


A 


5899 


424 


1 


GTMLQALAAHDAGSRAHVLLSLSQQDGI 
EQHMDFDSRYTLLELFAETTSSEEHCMA 
FEGIHLPQIPGKLLFSLVKRYLCVTSLL 
DQLNSSPELGAGDQSSPCATREKSRGQR 
EIiEFSMAVGNLISELVRSMGWARNLSEH 
V 


5852 


19753 


A 


5900 


135 


459 


YFMLKAEHVI S YVCS ENS LMVWNMRFC 
LIGLIHIQEFYFFEMEPHRAIQAGVQWC 
DLCSLQPPPPGLKRCSCIPPSSSRDYRC 
APPYP 


5853 


19754 


A 


5901 


418 


10 


GRSPKPGDRLWESVHLSFPGPADSLSGG 
SAPASSYEPSQRSSFSSNRSQRGSTSTR 
NSSQKGSSVLSIKQKGKRELYMEKLQEH 
L I KAKAFT I KKKF S QNLVAQL W YFVKCV 
Y FGL S AYQ I PRG Y P TRVLGNFLT K 


5854 


19755 


A 


5902 


402 


2 


GYRHPELEIKSVDGFQGPEKQPVIEjSLV 
RSNRKGEVGFPVEDRRINVDVTRARRLV 
TVI CDYRT VSSHAFLKPLVE YFTQHGEV 
RTAFEYLDDIVPDNYSHEDSQGSSQAAT 
RPQGPGTSTRTKKQRQEGGMY 


5855 


19756 


A 


5903 


346 


1 


GICPLSHIKMTDARCQIYEMKGGEKSPK 
D TGKE P GHS E AKTGP P QVLAG VP AQPE A 
PQPGPNTTAAPVDSGPKAGLAPETTETP 
AGAS ETAQATDLS LS PGGES KANCS PED 
PMY 


5856 


19757 


A 


5904 


139 


212 


EVEHNDD I SHHHHHHHHHH SNS KS 


5857 


19758 


A 


5905 


2 


345 


PCGCWGRCALLL I SAAAKAKS KCGPTFL 
P CAS G I H C 1 1 GR FRS KGFED CPDGS DE E 
NCTANPMLCSTARYHCKNGLC I DKS F I C 
DGQNNCQDNS DEE S CE S SQGVGY VCGFK 
KA 


5858 


19759 


A 


5906 


432 


17 


PQTTPHRTFGGGKAAWLLAVGGQFLLC 
WSPYFSFHLYVALSAQPISTGQVESWT 
WIGYFCFTSNPFFYGCLNRQIRGELSKQ 
FVCFFKPAPEEELRLPSREGS IEENFLQ 
FLQGTGCPYKSWVSRPLPCIAAALRV 


5859 


19760 


A 


5907 


407 


1 


PYACGECGKS YRQSS SL.VSHRRIHSGVR 
PHHCDECGEFFSRKYDLLIHQRVHSGER 
P YKCS E CGKS FSHS CS L I AHQR IHTGMR 
PYECSECGISFIHSCSLITQQRVHLGTR 
PYMCSECGKSFSQSCRLIKHRSV 


5860 


19761 


A 


5908 


419 


3 


VLAKKTI I TKS ARD CHE FGNILHLS TNL 
VASIQRPDKHESFGWNMVDNLDLFSRSS 
AENKYDNGCAKLFFHTEYEKTNPGMKPY 
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E=Glutamic Acid, F=Phenylalanine, 
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GYKECGKGLRRKKGLSLHQRIKNGEKPF 
ECTACRKTFSKKSHLIVHWRTHTGVY 


5861 


19762 


A 


5909 


441 


57 


KVMGIFKSSLFQALVDIQEFYEVTLINS 
QKSCEQKIEEANQVLQKWEKTSLLAPCH 
DRLQKSSELQSLIGQHSSLETTDGPQPW 
L FPRG S EGHRG KT YDHCHS IGLQIAVDQ 
RKMLHVLS KI KKIRV 


5862 


19763 


A 


5910 


3 


317 


CLLMLWRRWGSRGSRQRRLQATPSWEST 
KTADGQGDRRESGS QRHCS S PHPQSKS T 
TT S S VPAAREG AQ E AAQGG PRKG Q TS RP 
Q VSTRQ CAM P P ARHRKLENDA 


5863 


19764 


A 


5911 


183 


1 


QNSD S KDS LNRVASRDHAKPNLTCHVS P 
AIQGTGS ISESSIPSVSDTSTPRRSRRQ 
LPPCI 


5864 


19765 


A 


5912 


395 


1 


KT IQ FN I C VKVFHR FS NSNKD K I RYAGD 
KTFKCKECGKSFHVLSRLTQHKRIHTGE 
NPYTCEECGKAFNWSSILTKHKRIHARE 
KF YKCE E CGKGFTRS SHLTKHKR I HTGE 
KPYI CE KCGKAFNQS S TPV 


5865 


19766 


A 


5913 


449 


2 


FGSHLEKEDEKKQELVDKAI KPS I EATL 
ESIQRKLQYKRAESSRPEDIKDMTKAQI 
ANEKVALQKALLYYESIHGRPVTKNERQ 
VMKPLYDRYRLVKQILSRANTIPIIEEE 
EGSEADSNVKPDFMVNKKTDFSARMVKR 
PLGGSARPD 


5866 


19767 


A 


5914 


464 


23 


SAHMTETRSKSFDYGSLSLTGPSAPAPV 
APPARVAPPERRKCFLVRQASLSRPPET 
ELEVAPKGRQESEEPQPSSSKPSAKSSL 
SQISSAATSHGGPPGGKGPGQDRPPLGP 
TVPYTEALPVFHHPVAQTHKHEKPYLPP 
LYCGRCG 


5867 


19768 


A 


5915 


366 


1 


LVSFINFFTSVIATLWFAVLGFKANIM 
NEKCWENAEKILGYLNTNVLSRDLIPP 
HVNFSHLTTKDYMEMYNVIMTVKEDQFS 
ALGLDPCLLEDELDKSVQGTGLAFIAFH 
EAMTHS PAC I 


5868 


19769 


A 


5916 


246 


366 


TS VKEQQMPGAVAHACNPS TLEGQGGR I 
TRSGVRDQPCQH 


5869 


19770 


A 


5917 


400 


3 


NDNAPE F YQ S VYKVTVLENAFNGTLVI K 
LNATD PDD GTNGD I VYS FRR P VW P AWY 
AFTINPNNGEIRTKGKLDFEEKKLYEIS 
VE AVDKGNI PMAGHCTLLVEVLDVNDNA 
PEVTITSLSIiPIREDTQPCI 


5870 


19771 


A 


5918 


410 


200 


CT PPQ P VKCRFLNRDRVS S CCPGWS Q I P 
GIKRSSYLSLPECWDLRPVILLFCFLSH 
ISSKQPYFLPPSYR 


5871 


19772 


A 


5919 


214 


407 


MYFYIDRASLLSPRLECIWTIAHCSLE 
LLCSRDLP AS AS QS AG I TG I RHHTWLKT 
HFYSSFKT 


5872 


19773 


A 


5920 


476 


39 


VP YGL F VGS TGRLGL P YLRGT S H P LRG C 
IjHAAATjWGRRLVQPXiTRWKHE G CAEE FS 
ANDDVALGFCGSHSLAALPAWGTQDEGT 
LEFTLTTQSWQAPIiAFQAAGWHGDFIHV 
D I FEGHL WS MVEKGQGTVLLLNS VP VTD 
AQPHM 


5873 


19774 


A 


5921 


430 


2 


SLRPSSTGPSPSGGLSEEPAAKDLDNRM 
PGLVGQE VGSGEGPRTS S PLFHKAVFLR 
P S S STM ILFETTKSG P ALGKAVS EGAEE 
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AKLGVSGSRPEVAAKPALPTQKPAGTLP 
RSAPLSQDTKPPVPQEEAGQDHPPSKAS 
RC 


5874 


19775 


A 


5923 


411 


2 


MSLKVD VEALENS AGAT YI RKKGGKVTG 
DSQPKEQGQGVLKKKKKKKKGMVPKNYD 
PKVTPDPERWLPMRERSYYRGRKKGKKK 
DQ I GKGTQGATAG AS S E LDAS KTV S S P P 
TSPRPGSAATVSASTSNIIPPRLY 


5875 


19776 


A 


5924 


402 


2 


LiQSGAIMNKFYQ PHEAHI P YLLQLF I D Y 
NLYGMNL INIAAVKFRKARRKSNTLHAT 
GSCKNHLSGNSLADTLFRWEQDEIPSSL 
I LEGVE PQST CELE VDAVAAD I LNRLD I 
EAQIGGNPGLQAIWEDEKQRR 


5876 


19777 


A 


5925 


215 


403 


AETSTYFFFFLERQFPFAPQLGGHGRNL 
ttSLNLLPRGLREFSRLSLPGGWIYGGAP 
PSPTFFC 


5877 


19778 


A 


5926 


320 


1 


PGRRDRWIWGQGECLEYLLASMTQWCER 
EISS IAPGELCCLLLSFLPQEECQNYVR 
VLIVAGRKVFMCGTNAFSPMCTSRQVGN 
LSRTTEKINGVARCPYDPRHRV 


5878 


19779 


A 


5927 


400 


1 


WATDGGLPLLAS S ATVS VALQDVNDNE 
PQFQRTFYNASLPEGTQPGTCFLQVTAT 
DADSGPFGLLSYSLGAGLGSSGSPPFRI 
DAHS GD VCTTRTLDRDQG P S S FDFTVTA 
VDGGGLKSMVYVKVFLSDEMY 


5879 


19780 


A 


5928 


398 


62 


HNTEVLVGESVTLECSATGHPPPRISWT 
RGDRTPLPVDPRVNITPSGGLYIQNWQ 
GDSGEYACSATNN I DSVHATAF 1 1 VQGG 
YPSSVPEPGLSPILTPPLARAAIPTDAP 


5880 


19781 


A 


5929 


403 


1 


LASDGGDPVLSGTSRICVKVLDANDNAP 
VFTQPEYRISIPENTLVGTRILTVTATD 
ADEGYYAQWYFLEKSPGETSEVFELKS 
TSGELT I I KDLDYEDATFHE I D I EAQDG 
PGLLTRAKVIVTVLDWDNANV 


5881 


19782 


A 


5930 


408 


3 


VASDGGDPVLSGTSRICVKVLDANDNAP 
VFTQPEYRIS IPENTLVGSRILTVTATD 
ADEGYYAQWYFLEKSPGETSEVFELKS 
S SGELT 1 1 KDLDYEDATFHE I D I EAQDG 
P GL L TRAKVI VT VLD VNDNANV 


5882 


19783 


A 


5931 


126 


391 


PVLTCAHFHFSAGAGGSYFMISRSLGPE 
FGGAEGL CL YLATT FATAM Y I LGA I Q I L 
LVSQAEPWSTLAMIKQFLWTPNRRLMVL 
I PTC 


5883 


19784 


A 


5932 


436 


359 


NKEWLPGSKVPEKS INDVKNTSGLL 


5884 


19785 


A 


5933 


365 


3 


LIIGMLTAIIGDLASHFGCTIGLKDSVT 
AWFVAFGTSVPDTFASKAAALQDVYAD 
ASIGNVTGSNAVNVFLGIGLAWSVAAIY 
WALQGQE FHVS AGTLAFS VTLFT I FAF V 
CISVLLYR 


5885 


19786 


A 


5934 


394 


3 


F Q Y F S RTDRVL KHE RMCHENHD KKLNRC 
AIKGGLLTSEEDSVFSTSPKDNSLPKKK 
RQ ITEKRS S GMDKE SALGKCDLKKVKND 
YS PLYS S S TKVKDE YMVAE YAVEMPHS S 
VGGSHLE YASGE IHPPCI 


5886 


19787 


A 


5935 


393 


1 


REDLIAGIDEFLDEVIVLPPGEWDPNIR 
IEPPK5CVPSADKRKSVFSLAELGQMNGS 
VGGGGGAPGGGNGGGGGGGSGGGAGSGG 
AGGTSSGDDGEMPAMHEIGEELIWTGRF 
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FGGLCLDIKRKLPWFPSCI 


5887 


19788 


A 


5936 


410 


126 


PI CYYKKAGRARWLTPVIPSSWDYRHQA 
QL I FVF LVE TG FHRVAQAG P E LMTS GD P 
LAYNFLCSYPNEVFRSQASASDLQFVLR 
NLEPPDAG]W 


5888 


19789 


A 


5937 


426 


1 


DATFHEIDIEAQDGPGLLTRAKVIVTVL 
DVNDNAPEFYMTSATSSVSEDSLPGTI I 
GL FNVHDRD S GQNAFT T C S L P EDL P FKL 
EKSVDNYYRLVTTRALDREQFSFYNITL 
TAKDGGNPSLSTDAHIIiLQVADINDNAP 
CI 


5889 


19790 


A 


5938 


438 


3 


AD P TKGL LRNG S VC VRAPGG VS QGNS VN 
LKNSKQACLPSGGIPSLNNGTFSPPKQW 
SKESKAEQAESKRVPLPEGCPSGAASDL 
Q S KHL P KTAKP AS Q EHVRC S AI GTGE S P 
KE S AQIiS GAS P KES PSRGPAP PQENKVV 
SPW 


5890 


19791 


A 


5939 


3 


191 


YS VCLWSQLLRRLRQEDHLS PGGGGCTE 
P WNTERLS KNKKTKKRERE KQ CG FHS M 
WYCNLK 


5891 


19792 


A 


5940 


399 


1 


QYSDDNDMSWKVTRAAAKCLDAWSTRH 
EMLPE FYKTVS PAL I YRFKEREENVKAD 
VCHAHHSLFKQTSPVQSWLCGLDAMDQG 
GTPLAMLQSQVANIVKAVHKQMKEKSVK 
TRQCCFTLLTELVNVLLGAVY 


5892 


19793 


A 


5941 


411 


3 


SPCEGPRRFQCKSGKRVDGGKVCDVQRD 
CRDWSDELLKVWCGACLRPLAGLSLLPS 
PSWYLGSRPSSAPCPDTFCSGPLFGFMC 
RPMASHGAFRPQASGLHLYKVLRACPSQ 
VLKNYVFSHKLGLSSFLPRSDHV 


5893 


19794 


A 


5943 


431 


26 


KAWGIPGDMGPPGITVRPGYNGLPGNP 
GVQGQ KGE P G VGL P GLKGL PGLPGIPGS 
PEEKGSIGVPGVPGEHGAIGPPGHQGIR 
GEPGPPGLPGSVGSPGVPGIGPPGARGP 
PVGQGPPGLSGPLVIKGEVSRVR 


5894 


19795 


A 


5944 


396 


3 


PLPVELIRVPAFLDLFMQSLFKPGARIS 
QDLKHKYIHIIiAYAASWETCKKNKRVS 
INKDELKSTSKAVETVHNLCCNEMKGAS 
ELVAE L S TL YQ C I RF P WAMGVLKW VDW 
TVSEPRYFQLQTDHTPTV 


5895 


19796 


A 


5945 


465 


32 


ERVTLAD I T WCTLLWLYKQ VLE PS FRQ 
AFPNTNRWFLTC INQPQFRAVLGE VKLC 
EKMAQFDAKKFAETQPKKDTPRKEKGSR 
EEKQKPQAERKEEKKAAAPAPEEEMDEC 
E Q AMAAE P KAKD P FAHMN KS T F VLDD C I 
AAAIi 


5896 


19797 


A 


5946 


414 


2 


ATPPVRCIRKEIRNWYVDIQPVQEPKAQ 
ACGNGHG III IAETS TGCLFAGS SLGKR 
GVNADKVAIEAAEMLLANIiRHGGTVYEY 
KQDQLIVFMALANGVSIIKTGPVTLHTQ 
TAILYDEQIVKVNCKENHSVDEQVY 


5897 


19798 


A 


5947 


411 


1 


E PCWRR IADS S VQTDDEDGE SRYLLS R 
RRRARRSADCSVQTDDEDSAEWEQPVRR 
RRSRLPRHSDSGSDS KHDATAS S S S AAA 
TVRAMS S VGIQT I SDCSVQTEPDQLPRV 
SPAIHITAATDPKVEIVRYISAPCI 


5898 


19799 


A 


5948 


153 


409 


LPFFPILLPFISFFRYRVGLLSPKLKCS 
GVIIAHCSLELLGSSNPPTSASRVIFLY 
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Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=GJycine, H=Histidine, I=lso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, '-Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LSLCLFVFEMESCSVAQAGEQWRDLGSL 

Q 


5899 


19800 


A 


5949 


21 


447 


RAAAMSSRNFRFLSSHRELYFGVDLPSG 
NLVVREPADREQLCRAKAACVLTYDLVL 
EDPLELHKIRIHVLDTNDNSPLFPAGDV 
QLHI PEFLTPGARFTLPNAQDDDEGSNG 
ILSYSLSPSQHFRLDMGSRVDGSEYPEL 
VL 


5900 


19801 


A 


5950 


268 


19 


QDNWAY YCTVPE YCKTKLNI LAWWLVIN 
YLSGDFYFFFIFFLDGVSLCHPGWSAVA 
RSQLTATSSSQVQGRFSCLSHCIAAAL 


5901 


19802 


A 


5951 


389 


3 


QQMDLARQQQEQIARQQQQLLQQQHKIN 
LLQQQ I QVQGHMP PLMI P I FPHDQRTLA 
AAAAAQQGFLFPPGITYKPGDNYPVQFI 
PSTMAAAAASGLSPLQLQQLYAAQLASM 
QVSPGAKMPSTPQPMY 


5902 


19803 


A 


5952 


2308 


307 


RRRPARKKTWVRDGGPHQGL/YPQLPSA 
VLQPTQPGHGPRALGKGRELESWQGKRP 
GK/GQTHRRKTRGTAS PAVYFS VEWGD \ 
SGGCPMTGKGAKQQRAGLGAAPNPELSG 
EQGTGRGRAQDSQGHGGGKGRRGQHPPG 
ND/PRGVSGMGQESVHSPEARPTGGGTG 
/EMLGRTWRHQQGLVGGTGNLLETGSWS 
GS GS WGRG WLR P QE GQ VT TGMGL AGR / 
YQPARQRAVELS PGAQGLRQRRGGWSGP 
PQETEQSGVPGG*GSWPLESHQGEQ\PA 
AGQKQSLQMLPGGFPCWS/SGMGGGPQQ 
LLESEGAGPSPGGGRHHGKGRVAVTTTP 
REGD\RGQSPGGHTLQLFPCPLWS*ETQ 
GQGSRTVQHQEWGGEGREGEGAGSDQSR 
ALGIA* ICPHTLRPTSMEPQSTAPAPGQ 
PPGPPSWGHRGHQGMGYGGRCPGQ/IQG 
RGRAQLGSTVG*RDG/QRG*MQGRDQQG 
PRSNGAGVGSCSHTSQK*IPPSSLCT*N 
SSHGPASG\QLWWSSPFIHSPGETNIPH 
TLTEPHSVPGWCWDTLRRHGAGQGHPGM 
ARSGTGEGQ/ QRGRY*ERGRRGRRQQKQ 
KKQGLKEPG*RAAPTLKGATRPLCRCLR 
KVQKPKQDGDVGS * LLKVFRAPGALGTK 
PQRTCRGPADFFHALSGLSNVHRSYCST 
NSGFGACM 


5903 


19804 


A 


5953 


461 


117 


HPRDIIGPVHQGLPPSPQPPPPCKQAPT 
TPSS \M * P * TNRHD TGLS GP VQ VP GAD W 
KPLNAP*LPELAFGEEPCRPGVPDPPYV 
WLPDSAPEHCVTWATSPGLSDLLSYPRR 
KQA 


5904 


19805 


A 


5954 


845 


610 


F FETGS Y / S VAQ VGAQWCNFGS LQSQPP 
RLKRS CHLS P PS S WD YRGAPPRLARF FF 
FCIFYRNGVSPCCPGWS*TPELQ 


5905 


19806 


A 


5955 


1 


378 


KT P VS DR \ ATKC C S E S LGNRRP C F SALE 
AYETYVPKEFNAETFTLHADICTLSDKE 
RQ INKHTALAEL VKHKPKARQEQLKAVM 
DDFADFVEKRCMADDKETCFVEEG* TLV 
AASQAALGLLHRIK • 


5906 


19807 


A 


5956 


723 


343 


G CQEG I C P S P CP V I P ATRE AE AGE S LE L 
RRRRLH * AE I VPLHS \ KAQRS L C S S D F I 
R I LV I F S GMF LVF TLAGAL FLHQRRKYR 
SNKGESPVEPAEPCRYSCPREEEGSTIP 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\y— Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














I QEDYRKPEPACS P 


5907 


19808 


A 


5957 


55 


491 


AQHFNPHNWPVT*LLPHFTNEGGQVTEA 
RCLKWQAR\CLVPMQKVPNAMPGT** PS 
AEVTPPVCNQHHWPQVEGIPSS /APAFQ 
WPLTMGHPC\WESAPRQDASTSPRPGTA 
SHCPSAVAFQERGSSPCPPHHSDSGLCL, 
LPRLLPAP 


5908 


19809 


A 


5958 


2 


413 


KKPDQ\AE\REHLCTSL\WSGRNTDKNG 
EELHGGKRVMERLKKAVKIANQCMDPSL 
QMQLYIE I LNRYI YFYEKE YDAGTIQVL 
NQLIQKIREDIiPNLESSEEPEQIT*HFH 
NTLEHLRLRRESPESEGPIYEGLILYE 


5909 


19810 


A 


5959 


871 


564 


WMCHATFGSHLN*AWRLRPVNSG/LLM/ 
P SEGKAKKE S TLM VGVTNPD \ M KGKI RL 
LLHSDGKDEYVWNAGHPDILG\HPLTLQ 
CPIIKI KKKL* * PKSGRTADGPDHSRI K 
V^VTPLGKEPTTDEVIjAEGKENME^VTE 
TC* IDMN* VLQGQTWNLQR 


5910 


19811 


A 


5960 


390 


271 


GINFFSRN/RSLTT*SRLV*NSWAQVIV 
LPWPSRVLGFQA 


5911 


19812 


A 


5961 


333 ' 


922 


GLS CRVFGRTLS C P P C ILRHLS E VETLQ 
TLQKKEIEDLYSRLGKQPPPGIGAPAAM 
IjSSRQRRLrSKGSFPTSRRNSLQRSEPPG 
PGIMRRNSLSGSSTGSQEQRASKGVTFA 
GDVGRM* IQNRSHVSPPHQGPPWSLCSQ 
NLMLSDQQN*ARKI PTLKG* KA\GGHGE 
CSSI IVKSQTYVNCIiLCGGVSSAAYHLH 
L 


5912 


19813 


A 


5962 


705 


387 


CVAQTGVQ^HDLGSLQPLPPGFKQSSHL 
SLPSSWDYRRVTPRLPNF\*FFVETDFC 
HI AQAVLQLiL\ S SNNLPAS ASQS AGVTG 
VSHRARPTLTLD I QAE I PDLKPSG 


5913 


19814 


A 


5963 


425 


182 


ASNQ * DSVGVGPSE PGAG YNLLVRFLKC 
\LEKHSSWVGVTQFSRCHLSPLSIVTRKG 
NSLTPCTSRVRQCLALLWLTHGSRTH 


5914 


19815 


A 


5964 


388 


14 


PPMYTQLCSIRSTQAFCFVFFSRDE/SL 
ALLPRIiVLNSWVQAIbLPWPPKVQG*QA 
QWL TL VI PALWEAMAGGLLEPRS SRPAW 
ATTQDL I STKENKTKSLSASDAAELCVH 
WRLLKI 


5915 


19816 


A 


5965 


1 


373 


DDQRVKSVINLLLAAYTGDVSALRRFAL 
SAMDMEQRDYDSRTALHVSAAQGHVEW 
KFL\LEACKVNPFPKDRWNNTPMDETLH 
FGHHDVFKILQEYQVQYTP*GDSDNGKE 
HHTVHKNLDGLS 


5916 


19817 


A 


5966 


3 


329 


HEETSRYEIYELIHVHPYAVKQSFDLEE 
YSLSQSTLEQVGHF*GFVSSMVYVYKTF 
PGSCRGSTAIiFSSLYFLPPFQVFLELSK 
EQELGDFE/EDFDPSVKWKLLPQEEP 


5917 


19818 


A 


5967 


2 


47 


MEDRRLVRGIPCPQHNARQCPAVPPGIQ 
AYGAA.P FEDLQVD FTEMSKCRGNKYLLV 
LGRT YSG * VE AYP TRTE KARE VTR VLLR 
DLLPRFELPLRIGSDNRPAFVADLVQKA 
AKILRI TWKLHAAYWPQSSGKVERMNQT 
IKNSLGKVCQETGIiKWIQALPMVLFKIR 
CTTSKRAGYSPYEILYHRPPPILRGDPG 
TP*ELGEIELQRQLQALGKITQTI/YSP 
S KWP VS L FS P VHPLS PGDRVWI K3STWNVA 
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C=Cysteine, D-Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
KHLysine, LHLeuctne, M=Methionine, 
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W=Tryptophan T Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














SLCPLWKGPQTWLSPPTAVKVE*IPSW 
IHHS HVKP * TA 


5918 


19819 


A 


5968 


2 


150 


KSQSI*QSSLLSYFEKLPQPPQPSA\TS 
TSKQDPPLATRLWLAEGSDDH 


5919 


19820 


A 


5969 


3 


371 


HEGKS G PMTG WLVAVGE VAM KI LLL C L 
CL I LLRVRS CRRKAARAALGMBAADAVT 
D*SPDSRLLPD\PPHPVPPQSSRPCFLC 
RLCMVCPLLDSPDPSFPCHPTSPQTLRA 
KDSAALDTLHV 


5920 


19821 


A 


5970 


396 


58 


YWVLSKGPVTCMIiLNSVFS FQASSCLS F 
L I NS * AP * PG I FLGMS VF I L P F FVF VGQ 
AGLELLTSGDLPDSASQ\ARITGVSHCA 
QPHTALSQYIiEQCrjAPSRQLrSVSLILR 


5921 


19822 


A 


5971 


442 


120 


ICMKQGLALSPRLDSSSAFSNTVQPMPP 
RFKSSYGLSLP/RSWDYRCTPPHMARSG 
NGV* P \ FMVETGS W* AAQGGLEDLSS SN 
LPP/ SAFQSAGI TS KGHCTRPAVFLVC 


5922 


19823 


A 


5972 


459 


434 


MWPLPWFSISSPSSPPPT*VSSNNHPCA 
VPRAYPGADIPGPGWQRLPPPHRGLSGP 
PESCLLTTILCSKHPNGCPLTQNPRSLP 
GPRPCLIYSTPSPPP*AQL/YAAPSP*L 
YTWREALIR 


5923 


19824 


A 


5973 


681 


1758 

i 


VANKCQI PD I KAKTYCI CNTKSKRS YLK 
T*NLHRGF\ I KKQ I E EFN I G KRHLANMM 
GEDPETFTQEDIDRAIAYLFPSGLFEKR 
ARP VMKHPEQ I FPRQRAI QWGEDGRP FH 
YLFYTGKQSYYSLMHDVYGMLIiNLEKHQ 
SHLQAKSLLPEKTVTRDVIGSRWLIKEE 
LEEMLVEKLSDLD YMQF I RLLEKLLTSQ 
CGAAE E E FVQR FRR S VTLE S KKQL I E P V 
QYDEQGMAFSKSEGKRKTAKAEAIVYKH 
GSGRIKVNGIDYQIjYFPITQDREQLMFP 
FHFVDRLGKHDVTCTVSGGGRSAQAGAI 
RIAMAKALCS FVTEDEVEWMRQAGLLTT 
DPRVRERKKPGQEGARRKFTWKKR 


5924 


19825 


A 


5974 


1232 


980 


SLS LS PRLE C S G VTAHC S LRL P AS SNS C 
FSASQVAGTIGACHHIWLVFVFIjVETGF 
HHVGQTGLELIjT \ * VI C P P WP P KVLGLQ 
A 


5925 


19826 


A 


5975 


1 


259 


L.TSYDYRRALPCLVN\FCVYFIjKREEIiA 
LLPKLFWNS WA* ATLLP * PPKI FGLQAQ 
VAMLRLKFPACLPACLPAYLPPFLFSFI 
PP 


5926 


19827 


A 


5976 


24 


223 


PYLTQISTNESSVCSHARAEFLDHQNQL 
/WLGTVTYTCNPS/TLGG*GGRIT*GQE 
FKTSLGNTARPCL 


5927 


19828 


A 


5977 


395 


136 


GSWDYTHVPPRS ANF / LVETGFRHVA* A 
GLELLGNPPTSASQSAKITAMSHCTWPS 
S YY * CE YNARFGSLHKRSGMT I YIKS YK 
RQN 


5928 


19829 


A 


5978 


1 


785 


GTRLKSGTNTKKALQAVYSMMSWPDDVP 
PEGWNRTRHVIILMTD/GSEGTSLLSQP 
PHL LR P ACGP * VHL * H YTHGWGP E CD S * 
LAVHLS CDPS * GILPKPCDQLSLTLPQL 
AHPAMCIPAFSQFIFLISYPHGPVSSAG 
LHNMGGD P I T V I DE I RDLL Y I GKDRKNP 
RED YLGE * P A*DP APHFLRAWTIi I LP FY 
PSDVYVFGVGPLVNQVNINALASKKDNE 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline 7 
Q=GIutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucteotide deletion, \=possible 
nucleotide insertion 














QHVFKVKDMEWLED VFYQM IGRE IQG2STK 
EPNSPQVPLK 


5929 


19830 


A 


5979 


2 


419 


KAI AP IDSDS SDGSGQJSFKLKT* KEF 1 1 L 
NAI KS I YDSWEE VKI STLTGVWKKLI P I 
LIEDFEVFK\ADLVEIARERELEVEGEL, 
/LQSQDKT 


5930 


19831 


A 


5981 


1 


154 


KKC*MMLTWWCMPVG/PTPRAEAGGSPE 
TGNLRL*CTMIMSVNSHCTPAWAT 


5931 


19832 


A 


5982 


12 


311 


RKS LNF VEGCN I RIjNKCRHTNF VPQT I F 
FFLL FL * LGQ C * *WVFSFFFFFFFCFV 
IiFFVFVCWFVFFCFCFF/ gclfwfll 
/YFYELFCIVI IMCYYLR* YV* IVYMLF 
FVCWI FVLYLCFHY*LFLFFFVFVFLLF 
I CGVFWFYEL FC I VI IMCYYLR 


5932 


19833 


A 


5983 


3 


203 


IGQKRASEDTT*GSAD/PKKSSAGPKRD 
ARQ I YJNTP PS GK YS SNLGNFNYE R S LQG K 
YRWPRLGHTRKS 


5933 


19834 


A 


5984 


368 


138 


STVGKMLSGQHPCYRE/IL/REQKSQSM 
*QISLLSYFKKLPQPFQPSATTALISQQ 
PSASRQNPPPAKRL*LAEGSYDH 


5934 


19835 


A 


5985 


2 


155 


LANF* IYFC/RDKCLPVLPRLVSNSWPQ 
VILPPWPPKVLGLQGMSPCAPRHS 


5935 


19836 


A 


5986 


325 


403 


FIYYLFFLSSLYLI ITYALFLSSF*R* V 
VYHMVMVLLKWI CHHFLTLFFS FRSL I 
LYLFFILFM/YLLIFVLFFIQFSFYLFF 
FYLFYYLFFNDYCFDLIFFLCLIYYYHF 
QSYYFIS * FIYYLFFLSSLYLI ITYALF 
LSSFSI 


5936 


19837 


A 


5987 


81 


651 


KLQ VAGCRTLS Q ID VKFKPS VHS S LAVC 
LRANYFTSQPNVIMQVTLLTRCVAHSKH 
LGTVSYSSWYIKLIiFPKSHFIYLFILRR 
SLAPVAPGVQWLDLGSLQPPPPGPKRFS 
CLS/LPR*LDYRHPPPRLANF\ *FLVEK 
G FCHVG QAGLE LLAS S DL P T S AS QNAG I 
TGVT PAP S PQ KATLNL I LGGQ FH I 


5937 


19838 


A 


5988 


301 


341 


FCFCPCKNRVSLW*WHAPPCPANF\*LF 
IYLFLVETGFSHVGQASLKLLTLDDLPA 
AASQSAGIPGMSHHDGLFL 


5938 


19839 


A 


5989 


71 


1252 


TKEQRSLRSQQLGVRQQRSHRLKFGGGG 
AP S ARG VGGS WS LATRLG P VS TANMSRP 
VRNRKWDYSQFQE SDDADED YGRDSGP 
PTKKIRSSPREAKNKRRSGKNSQEDSED 
SEDKDVKTKKDDSHSAEDSEDEKEDHKN 
VRQQRQAAS KAA.S KQREMLMEDVGSEE E 
QEEEDEAPFQEKDSGSDEDFLMEDDDDS 
D YGS SK/NEKQKDG * EVQT * KKRKENAQ 
TQTKGYSDAKSSERQRESGSP\QASKAS 
KEKTPSPKEEDEEPESPPEKKTSTSPPP 
EKSGDEGSEDEAPSGED*K**WSGERFY 
*KKKEKKRKKREEKKEPT*DRTWFWLWL 
DSWGFQCFFPFVESNISLSLSFFFFFFK 
ANHCMCKCLSYLFVYWSLCQPSPFPMKA 
MSN 


5939 


19840 


A 


5990 


630 


316 


RWNVNSVA\QAG\ IQWCDLGSTGNPRPP 
GF\K* FS CLiSLLS S WDYPTHATMSSKFL 
YF*WRRGFTMLARLVSNWLVLNS *PSDP 
PAS AAQ SAG I TGMS HRAQP I YL S 


5940 


19841 


A 


5991 


2 


182 


WQEQKAE / CLRDVWTAMCTAALLTIAKG 
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E=Glutamic Acid, F=Phenylalanine, 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














R\NIPCPMMKEWVRKMGPIYTFEYYSAV 
IMQVILKF* * HAT TGMNPDD VISE VCQS 
QKDK* RILLS CNNAGNS 


5941 


19842 


A 


5992 


79 


278 


EAK*RVNGCS KPTWHMYTYVTNLHTYV/ 
IKPAHCAHVP*KLKYKIIITIIK* *RWG 
LTILPRLVSNSWAQVILLPWPPKILGLQ 
A 


5942 


19843 


A 


5993 


22 


274 


AGDEHRAWGYQPAM* ARLEMNTGPGDT / 
MPAMAS QRAGITGVSH * GCPFFS LLNKI 
WRSH*VAQAGEQRLFRE*TPGLSCYSCS 
LPSN*DYTSAPLCPPYDT 


5943 


19844 


A 


5994 


347 


195 


YMHTKHTHTHTHTHTHTHTHP \EKHLKA 
PNEV*KSKNNV*IKPKQSQSCFS 


5944 


19845 


A 


5995 


1 


122 


RYELEDELHLQMTTQLHALKENNYQLR* 
SSHVSIPST*GYRHVPSCPANF/SSLTM 
FPSLVSSCWAQVIHLPWPYKVLGL*ALK 
ENNYQLRTI I IDCTLNNNIF 


5945 


19846 


A 


5996 


365 


229 


RPPPCPANF \ * FLVETGFHHVGKAGLEL 
LTSSDLPAPALHQVFYA 


5946 


19847 


A 


5997 


357 


17 


I LVDKIKNVWNLFQNNLWC * RGWGYK* 
DRSFHLLKTVVAE*WVYEGSLY*SLSSY 
YLF/C*RQGLTMLPRLVLNSWHQAILLP 
WPFSLLLCILEVFQNRELKKKNQGKPMT 
GS 


5947 


19848 


A 


5998 


1 


207 


PILYNLFQKIEAEGILPNSFYEASVTLL 
PKPDKDITR/ITTY*FILLT*NILNKVL 
SNQIQL YI KRITY 


5948 


19849 


A 


5999 


603 


269 


EDRVLVLLPKAKSAVAWPRLTATSASW \ 
VQRSFHPSLPSRWDHRHVPPHSANFCCV 
CL*RQGLSLLPGLVLNSKTAQVILPPWP\ 
PKVLGI TSMSHHARPLS FYYGHFKYIQK 


5949 


19850 


A 


6000 


3 


150 


DYRHAPLHLATFKK/ FFCRD/RGLSVLP 
GLVSNSWPQTVLPP* PPKALGLQA 


5950 


19851 


A 


6001 


365 


62 


AI TAHCS LNLLGS SNPTAS ATR IAGTIG 
EAAMSYF * I S STLTHFS YT / C * LHHVWL 
I FKFFFVDEVYVAQASLELLSSSDPPAL 
VSQNARFTGVSHCTEPV 


5951 


19852 


A 


6002 


537 


236 


DRVSLCCPGWS AVARFQLTATSAFR \VQ 
±SSCLSLPSSCDYRSAPPCPANFLSFCR 
DEGGGL PGL PRLVSNS WAQA I HP PQ P P S 
VWITGVINGTWPKIPLS 


5952 


19853 


A 


6003 


22 


368 


NFFLQNKENIKRKI F *ERKHKLMQRTSL 
LSYFKQLPRPPTTSQQPSTWRQDPPPAK 
R/LRTYKGLDDR 


5953 


19854 


A 


6004 


327 


335 


NQS IFTFNNYPRPF / CCKFKFKN / TLHL 
WADKVAHTCNLSTLEGQGGRIA*GQEFE 
ATLANMVKP * LLCKYK 


5954 


19855 


A 


6005 


2 


367 


WQFLKKLNTKLLYGPVF/ PYLGI YKQEK 
WKHVHAETCTQMFITVLFIIVPK*KQPK 
CPSMEE*ID*M*FVHMLEYRAIKENKLI 
HAT WMNLRNTMLSE I SQTQ I TT CMNLLH 
KMFRKVTETES ! 


5955 


19856 


A 


6006 


1 


176 


PGFKQFFCLSFWTRGIFIEMGFHHVG*A 
GLGLLM S GD P PAS AS Q SAG I TG \ VSHCA 
WTY ; 


5956 


19857 


A 


6007 


1 


289 


EVLTKI P IR/ FFVE I GKL I QDLKDTGPR 
I A* TMLTKKTEMGR I VHS DLVAYY I VAV 
IKMVWYYW*RDRHINQ/WERLEISEIDS 
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QNMFNWVFTKMQKLF 


5957 


19858 


A 


6008 


439 


223 


FFSPGSH/SIT*ARVQW/PHHSSLQPEP 
PSLKQS SHLSLPS S SDYRCAP PHPADLC 
RDWVSPCCPGWSQTLGLK 


5958 


19859 


A 


6009 


1 


252 


MVS IS * P RD P PALAS QS AG I TGMS HHAQ 
L FFLN / CI S/ELISS TVTPGLKESACLG 
LPKCWH YRRE P QH PACT VL YL Q F Q SD FQ 
LK 


5959 


19860 


A 


6010 


131 


514 


PWPYSLSSSGLLSSFEN/VEKWVPEIIH 
\HSLFLGTTIDLRS\EPCTTEKLN\KDK 
YNPLTPQMADMMVCDLKVVE * LECSAFM 
W KRLK\ VFDE ALWAALEED * RS S CRKE T 
STLLQSTPLLVSYALPCTP 


5960 


19861 


A 


6011 


625 


141 


E TGS RF VTHAG VKWHNHD S LHP Q P PRL K 
* FS YFS L L LiS S WD YRH VP PR PANL VYP L 
*RGGPPSMLPQASLEF\LDSK* IPPCLQ 
PS/AKVLGLQVSHRPSPKVTFHQRAKEG 
D WSHWSRGKS I PDRGNS Q CMHRS IPC 
VFEEQKGGQCYCSIESKEETSRI 


5961 


19862 


A 


6012 


302 


42 


GLAMLPRLPGTPGLKQSSCLSLPSSWDY 
KHILS*PAVY/C*FLKNIK 


5962 


19863 


A 


6013 


363 


82 


VSVSLSPSLKTKVNYAVASASFTIAKIW 
KQLKCPSMDKQMK* IWHIHTMEYYSA/X 
KQKIMS YSTTWMNLEDIMLSE I S I SQAY 
KGKHHMIYGI 


5963 


19864 


A 


6014 


329 


99 


H I FGNTYF FQVAS S F I YLFRDS VSLCRP 
GWSAVA* SWFTATSG\FRPSACLGLPKC 
/ WD YRRE P PR P ANVL FS MR VTH 


5964 


J 9865 


A 


6015 


324 


238 


LAMLSRLVSNSWPQAILLPLPPKVLGLQ 
A 


5965 


19866 


A 


6016 


2 


308 


FWKAIAG I EGDS S D KCRQ/S KLKT I WKK 
FI SLDAMRNTHDS * EKVKMS IFWKMIPT 
P RDD F / E GL KT S VE KRTED WKTARQ * E 
LKVDRT * TDEELLLLDQQRK 


5966 


19867 


A 


6017 


2 


457 


FFFLRWSL/DSVAQAGVQWRDLGSLQAP 
PRGFTP FS CLSLPS S WDYRRPLPRPANF 
FYF* * RRGFTMLARMVSIS * PRDLPAIiA 
S QS AG I TDVSHRRAERVI S KQR I VS VMW 
KPLPEIHIPCILNLSLRFNHRSVAGLRN 
SIiI VRMLS ILTHG 


5967 


19868 


A 


6018 


389 


142 


AHMLFAAQGDSS I PMLVAPLFT WKMWE 
QSKLPLADEWSGFW*HIHVTEY/YAIKR 
RETLTHATTSLKALCYVI*ALTKGRTL 


5968 


19869 


A 


6019 


384 


232 


LNLPSSWDYRRAPSHSA/NFSYF*RDGG 
LTML\PGWPQVIQMPRPLKAL*LQA 


5969 


19870 


A 


6020 


333 


84 


GLIDQEFATDTTKSTSLSLT*T/IPSRC 
SLCLRGSSDSPASC/SQPPASVAGITDT 
CHHAHI I FVFLVE TRVHHVGRANTEKPR 
LH 


5970 


19871 


A 


6021 


106 


366 


DTISFFFFLKAPIEPVFIIFIFFYFFFI 
FFFI*FFFLLFLFLFIFYFVF/IFFFIL 
LCLFFFFLVLLFYLLFFF/SLFFLFCFV 
FCFFF* FCYFYFYLFFILFFFFFYLALF 
I FFFFGS SFLFI VFLFLFFLFCFVFCFF 
F 


5971 


19872 


A 


6022 


384 


18 


REKKPGPGTGKPPKKKGGPKFGPKGPKK 
RGEKKKKGGP / NS FFPRGGGTPGGNF*R 
G F P PENQKKRGRKKF FQGGKRGA3STPGI P 
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PLLGP KAEGGF I PGKKKKKKKNSNKKS K 
QTKKALSRTSHS 


5972 


19873 


A 


6023 


3 


191 


LVKGKI RS WEETWVKTW / RFYPGVMAH/ 
TYNPNPLGGQGRWIA*AQEFKTSLGNW 
KPHLYKKKI 


5973 


19874 


A 


6024 


364 


142 


I INGL I IHLHNFKINGHTLS T W YTFH I L 
CVCVY I YI CLC \ C I YVS VCL CVCVYTHI 
QYTY* INKHIC*RLEAK 


5974 


19875 


A 


6026 


338 


102 


WGFTXLARLVSNS *HLVICHLDLPKFWN 
YRHEPPCLALFLIFVGT* *VYIFMGYRR 
Y/C* YRHAMCNWHI RVNMVS VGLNF 


5975 


19876 


A 


6027 


99 


77 


RE I CTPTFTEAFS I EAKGWK* PKQPLTN 
EWINTMWCIHMRGKF*P/YAKTSMKLED 
VFHEISKSQKDKHYYI*FLSPTYVKIQA 


5976 


19877 


A 


6028 


2 


328 


TITYRGAKIRITSDSSSETMQAREE*SE 
IF*VLK/EKKKTH*PKILYSGKLSFKSK 
GEIK/YF* GKQN / LKEF VS S RP VLQGML 
KWLQRE GKL YRS E TQTYKKKE KAS E KE 


5977 


19878 


A 


6029 


8 


241 


GFAP LPRLECSNTNMAHHS LNLSGS S DS 
PAS ANFS I F \ I ETR FGHVAQAVL * FL S S 
SNPPISPSPSVEITGVSHHARP 


5978 


19879 


A 


6030 


108 


362 


CC YLMVTE I S AHQNPAKQGLA/ HTAF P * 
KTFFLKVSWTGTVAHV*KPSTLGG*GGR 
TS*SQKFKTSLGIMRPPSLQIKKKVFKI 
YP 


5979 


19880 


A 


6031 


1520 


77 


ASSSSSSSSVILSS*NIPNYNYIVIFM/ 
CL*IIF*AASSSSIYYVPGICYIYH\IL 
CVCVCICMCIWCICIYSIC*YIVYVSI 
YYIHY 


5980 


19881 


A 


6032 


405 


100 


EGFFLPPRGGGGKR/ PPGPLTPGGPGNP 
RPKPPQKWGKPGPPPGGGPF*WVL>LSPS 
LPGGQVESPGAVSGPRQVPPSGSRPFSP 
TAVGTTVT FLS KKKKKQV 


5981 


19882 


A 


6033 


42 


309 


CDKFFHKASNHIYVSTYIWVFQICVCVC 
VCVCVMCILC*AY/CGCLGIMIWFLFVL 
* CI / CYCCY* S S LFS VI I C I FLMCFVS Y 
ERRFILFA 


5982 


19883 


A 


6034 


397 


2 


KGPPP*GGPDFF\*PRGGPPQKNPFFPP 
GGG P P P P KNGVGG P PPQ KAP FFGP KRGA 
P QKGG P PGGGP P RGE KKS L C P P PNWENT 
PRGLKKGPPPFFFFFFFFFFFFFFFVLR 
STGLRAPFQSVFIVELVEDMS 


5983 


19884 


A 


6035 


343 


2 


LWPPQGSFKTAAPFFFFFFLFISDFITF 
HFLYM\YVFFIPLFLL*SFFCIK* IFYY 
VTF * FLQS FSILFFS WFL \ WLL * SLL Y 
SF*LIRIYFRFILT*FQ*DTETLFFYYS 
IP 


5984 


19885 


A 


6036 


319 


68 


SLGEICAPK*FFPPAQKKGPFQKFPSCK 
FSPSPVFKTRPRAKF*KGPP*GKKFYVA 
NPGKI\GPPKGSFKRPPLFFFFFFFFFF 

F 


5985 


19886 


A 


6037 


419 


191 


R IHT I P I TALFAVAKR/ WQS KYPLVDE / 
MDKI W Y I HTMED * AAYGKKE IMS HART W 
INLKNIMLSEISQSQKDKYFPFI 


5986 


19887 i 


A 


6038 


434 


401 


YGKNFSFSLKV*IFCEGLVLVFFPPKKK , 
/ FFFKNFHRWFPLYVFF*TGRGWFFK 
PPFLEKI FFFLTRVNLGPPRGFF* GAP F 
FFFFFFFFFFFFFFFFFFFFFFFFFFFS 
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*NFYFMN* I FTVI 


5987 


19888 


A 


6039 


12 


346 


SFSFVRRSLTVTQAGVQWCNLGSLQPP \ 
LP PS FK* FS CLSLQSNWD YRCLPS CLSN 
F C I F /M * RRGFS MLARL VSNS * PQ/ CD P 
PASASQSAGITGVSH/SHLV*NA*SFKT 
ESHF 


5988 


19889 


A 


6040 


275 


45 


KD VQHQW KI VQ P L W KQ YG S LKAS TE S PH 
DPSILLLGTYPRKMKT*IHIK\TCIRMF 
AEALFLISKK*KLPKCLLADK 


5989 


19890 


A 


6041 


355 


1 


FGTI KNFFFYI FFFKNYF / FFLYII FFK 
KKVLFFF* KKKI FFLF I PNKKI FWVF F F 
FFFFFFFFFCPRFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFLiVTRHLF 
EMRKGN 


5990 


19891 


A 


6042 


635 


209 


PLNIKKTFFFLRQSFTLVAQAGAQWHDF 
GSPQPPPPRFK*FSCLSLPRSWDHRRPP 
PRPANLVF\DFLVETGFHRVSQDGLDLL 
T / S / GDLPAIAS Q S AG I TGVS HCARP TL 
EFLKVKEMIFCSTTVKAVLDHANFLILR 
FSIT 


5991 


19892 


A 


6043 


12 


348 


ESRYSSICLPFSSVYCSRIISIjQLYSYV 
LLVYILFFF/CFLKREFGFCPPLGVQGP 
GLGSLNPPLLG*KQFSGLIFPGIGNTGL 
APPSPPNFGFFRKKGVSPCGPNRPAISK 
F 


5992 


19893 


A 


6044 


192 


1 


IFHRFCT/HITLP / TTALIOTSr* PWLGTM 
AYAYNPNTL*G*GRRIV*AQAFKSSLGN 
MTRPCLYKK 


5993 


19894 


A 


6045 


338 


42 


WKTAW W / FL KKLD / MN YG * D P AI LL LG I 
Y P KE L KAGT * TD I RTS M F T AE L F AI AKR 
WKQNVDLKNLRQNE CRQF I WAKVEGN WL 
PS WFGEWS I QHRLQAGF 


5994 


19895 


A 


6046 


300 


3 


RLDCHYLHSHTHTHTHTHNPAGIYTHH/ 
HCWNLHTHNPGG I LiGVALDL * MQFLiLNR 
GWS WGGE QMAW P C P QGHVMWLGDPGKM I 
LKPPLWVCQLQGSSYDG 


5995 


19896 


A 


6047 


332 


3 


SLESAGFT* YKMYFYYYYKIK\YIYKS I 
FIILFL*IFFFFDLiFFIFFFFIFFFFFF 

TP IT TP TP TP TP TP TP TP TP TP TP TP TP TP TP TP TP TP TI* TPTPTPTPT' T> 

FFFFFFFFFFFFFFFFFFFFFFFF.LKS P 
KRVIYFGFGTQNPQMFSLLTLKYVLC 


5996 


19897 


A 


6048 


2 


152 


KNLKISWT*WLMPWSAT/WGGSLKPGR 
LRLQ*AVIAPLHSSLGDRARPYIi 


5997 


19898 


A 


6049 


1 


288 


PIIIYVLLLLFFETDSHSVT\RL*CSGA 
FS THCNL CL PG PGD S S AS D S / RVAGVAP 
ACLF* PYRGFSGPATIAFRALDPGLPLH 
PGFSLQRPSCSRGG 


5998 


19899 


A 


6050 


358 


1 


fsifsiiiifdflggflkffaffygwdfl 
rfpffyr*pfgpkssfifphvf*r\wiw 
flllsswflkwsfdpqfflkrfffffff 

FPKGFFYKRVFFFPGFFFFSNXj LiLiSSQG 
vyvui 


5999 


19900 


A 


6051 


308 


45 


ADLSAEAL*TRREWDDIFKVLKEK/NKG 
QPKILYPSKLSLINE/NEIKSFPDKQKL 
REFTTRLVL*EMLKGILHMEAQGQYLPS 
*KHTKV 


6000 


19901 


A 


6052 


2 


124 


I FCVLVETGFHWVAQAGL * LLS SGS P PA 
S/DLPNAGITCVSH*AGITCVSH 


6001 


19902 


A 


6053 


I 


330 


LC* PGWSR\FLTS * SACVGLPKCWDYRR 
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EPLHLVFFFFFFFFT* IYLKGFYLNPRK 
IFKNFFPLLKNPFFGVPPKIQFSPIPNL 
PSLWPNMVFFKRVLGEKI FRGFFPP 


6002 


19903 


A 


6054 


34 


293 


SPILFLLCLRHQSGCHPSWTAVAQSELT 
AASNSWAQAILPHSWDYRHAPPH/LNLF 
GWSHAALPRPDSNF* PQAILLPQPQVLG 
LQA 


6003 


19904 


A 


6055 


319 


217 


YHFT P TGMAI S KNRQKTGDNKCC* GCRE 
I GML VC C * WQ YKM VWLL / WKT VWHSL KG 
IK/L*ELPYDPAIPL*GIQRVSWKPIPH 
SSEG 


6004 


19905 


A 


6056 


341 


151 


VHAGGSPES/REFETNLGNKARLHLYKK 
* ISQMWWCIPAVPATHAGPSLHFNLYYS 
PWLFQCDY 


6005 


19906 


A 


6057 


2 


558 


FFFLRWSFTLIAQAGVQWCNLSSLQPLP 
PRFKRFSCLSLPSSWDYRHVPP\RLANF 
F VFLVEMG FLHVGQAS LE LLAS GDP P S G 
DPDLR*IiAFQSLGITGVT/HHAGQIFIiF 
/CLETESPSVAQVGVQWCNLGSLQPLPP 
\ GS E * F S CLGL P S S WD \ YRHAL THPANF 
\SIFSRDGVSPSWPGWSQMPDLS 


6006 


19907 


A 


6058 


320 


21 


EICMTTNHPSVSNTIW/SKMTLASVYLT 
LLPISKTSKTSGLPASKEVNRYLGTCAI 
RYFVFYFI IMLFV*RQGLTMLPRLVSNS 
WAQVI LP P WP P KVLGLQA 


6007 


19908 


A 


6059 


294 


50 


CYFS LS FGL * Q FLSLNQS F VTLS SHCDR 
DTFE * S WS VI / PG IGVYSS * E * TRGLAM 
MPRQVYNSWAQAILMPWLPKMLEVQT 


6008 


19909 


A 


6060 


616 


354 


ERVSPCHPGWSAVALSQPTAALTSW\VK 
QSSHLSLLNSWDYKCMPPLLANF\KFFC 
RNE/SLLMLPRLTSNPWAQVICPP*PPK 
VLGLHA 


6009 


19910 


A 


6061 


214 


11 


AASTRPSAWQPPLLGSEEPLCPATTPSG 
RCTQQ / LH * ERAMMTMAVL WNRKGGKVG 
KRLRNRLVAMSV 


6010 


19911 


A 


6062 


349 


54 


QSKSAFPKKRINRISDSSEGYGQSKLKT 
FWKGFTILDAIKNI CNSSEEVKIATLMG 
V/ WDDFEGFRTS VKE VAADVC/ ETARQL 
EVEPESVTG*QQSHVQP 


6011 


19912 


A 


6063 


332 


3 


SQPASGQISKVTQSTLRCLKMSVQQVKS 
C YKCTAT YL CAKFNS HE FRS S G VCEN P V 
S YF Y THS I RSK I L / W PGMVAHTCN P SAL 
E S QGGCM T * GQHCETS M VNM VT PRVSRA 


6012 


19913 


A 


6064 


174 


416 


NENELFFCFFFXIFFFFXFFFFFFFFFF 
FFFFFFGFFXFFFFFFFFFFFFFXFFFF 
FFFFFFFFFFFFFFFFFFFFFFFXF 


6013 


19914 


A 


6065 


826 


571 


DGSHSLA\RLECSGVISAHCNLHLPGSS 
DS VASAS * VAGTTGTCHHARL I F / WFL 
VETRFHNLGQAGLELLVIHPPWPPKILG 
LQA 


6014 


19915 


A 


6066 


1 i 


284 


GTSFFF*NRVLFCCPG*RAVPQ\SWLTA 
ALTS WAQGS S *NHRLKP /HMLS FFNFCK 
QGLTML P KGALNS WAQAI LQPQPPM* LG 
LQACT YAWS LGP 


6015 


19916 


A 


6067 ' 


36 


384 


VS KNKP IHLFLF FFS KTKSWS VT \RLEC 
SGVTSAHCYLRLPGSREGAASAL*LSAT 
T WLCKVAL PLGEDLGALCRNP WGAG VKG 
S S RE S P P LGE ARGGE L P WGQ VR I RLWGR 
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GEPAP 


6016 


19917 


A 


6068 


393 


116 


PQPIFPRY*QNKDPNL*KTF*HKYMPKC 
PAK\LIIFIEKGFSRFGQPGL*HLSSSH 
liPALASQSAGISGINPGGRARFPLFFFF 
FFSLLLLSF 


6017 


19918 


A 


6069 


53 


280 


FPRFCFC/LLESESHSVA*ARVHWRRLG 
PG/LKRFACLKLPSNWNYRHTPTHPANF 
CIFIRDQGMTL*PSWS*T\PDIK 


6018 


19919 


A 


6070 


3 


377 


HEEPQGCATVPGEFFKFLWRHGLITYPR 
L VSNS WAQAAL P PQ * METH 1 1 FS IGRLH 
MVK\RQYWSGAVAHAYNPRLLGS*GGKA 
A*AQEFETSLGYVMRPCLHKNLKNSPGT 
VAHPCGSSC 


6019 


19920 


A 


6071 


337 


3 


RESQAFFSKKKKKGFTKMWFEQRCERG 
KGARQQSHLRKS I YV* E IASAKALRRHM 
PGVFQKQPGG/WPGWAHT*NPRTLGGR 
CGWIT*AQELETSLGNPEKPCLYKKYKti 
V 


6020 


19921 


A 


6072 


1007 


628 


FFLIxAHS PHS VAQAG VQWGDLGS LQALP 
PGFMPFSCLSLPSSWDHRRLPPRLANF\ 
F I FL VATG FHR VS QMVL I S * PCDPPASA 
SQSAGTT/ GARPKVF* IFVYMYFGVRKH 
S ILMSMPQHDLFGYKI 


6021 


19922 


A 


6073 


27 


364 


MDLPSVSIFDLRSLWADISRATSTLFWL 
LiFSWNCFLHLLSFNLFLSLGQK*VCCR* 
HIQSCVT*QWRY\WSGLVTHAClSrPSTFG 
SQGGWVASV* EFETSLCNMAKPCLYKKY 
E 


6022 


19923 


A 


6074 


2 


68 


ARACSHGTVALTAS CFPLQLDYKCLQYS 
FVCFFFFETKSNFVT\RLKCNGPISVN* 
NFNLPGLTRSQA*ASREAGTTGTCYHA* 
*ILLPSSTRL 


6023 


19924 


A 


6075 


330 


49 


KRRFAL WQ AG VQ WRD LRS LQ P L P P V F S 
CLSLPSSRDYRCPAS /RPS * FFLELLTS 
GDLPTSASQSAGI IGMSHCAQPPLI YFL 
FQIKVILFFS 


6024 


19925 


A 


6076 


2 


267 


ARGVEVKVGKTQKFLNMIRRTLVHCWWK 
CKLIQPLF/WRILPKVKEELPYHLiAILP 
I,GIYPKQMKSISL\KYICSYI * IVDHFV 
* AILPLGI YPKQMKS ISLNIHMFLYIDC 
RSFCIGTFLLL 


6025 


19926 


A 


6077 


379 


236 


PPPPQTRETPPPGHVHKDPPPPHHPYPP 
PQEPQPRTPVPNTHPGEEQNHELFNSSL 
LEPWW\YYSSLLPPEQA*EPHIH 


6026 


19927 


A 


6078 


2 


290 


RRSLPSSPLEDGRLSIAPGLTWEPPSHP 
PPQ*RSPPPPKKKKKKKKSPPGPKF/HK 
FKTGKPE KP PR VL * RGHWP P FKF F *" LKP 
GLPFF* GTKPPKV 


6027 


19928 


A 


6079 


1241 


1487 


KMYC I FFVE * KPCLSCAIFIRLLKWS * P 
TVCNSISGGPPHG*IMPSIIL*FLFLET 
VFHS V\ TRVGVQWPDHGS LQPGP PGLK* 
SSCLTASQSTQIIGVSHHAWPPSITFEI 
FKRLAYSMS ADGW * KS P PMKLDLGALKK 
KLSFARPG 


6028 


19929 


A 


6080 


333 


1 


LCHHVQLSFKFFVEMEVSLCGPGWPFK* 
FSHLGIiPKCWDYRHDPPHIiAFFFSLS\ * 
FFCRDRVSMLPRLVLKSWSQAILLL* PP 
KVFFLF VCFLRRVS FHHAEP YAVS S C 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, =Stop codon, /=possibie 
nucleotide deletion, \=possib!e 
nucleotide insertion 


6029 


19930 


A 


6081 


2 


239 


ARGKYIYNRDKISFCCPGWNAII/RSLQ 
P * TS ELKRSSHINRPNS WD YRCMPSHLA 
NF * I FRRDGGLASLPRLGTGWLKS 


6030 


19931 


A 


6082 


323 


185 


SACLGLPKC/WDYRREPLCLVSR*VLNS 
* AQAI CLPWPPKVLGLQA 


6031 


19932 


A 


6083 


296 


2 


GER I LEE CI YARSHRAE I TSLHCSLSVS 
K/HSVAQWVKWPDFCSMQSMPPRFK* F 
S CVS WS S WDYGLP PPCAGLH I WRRNVT 
LALQGLPTTSLALV 


6032 


19933 


A 


6084 


443 


1 


QKPK* * KTQTWS S I SLQ SRKMKKKI LS V 
ITFFKT*GEEEEEKEKKRK\KEE*EK*K 
EP S Q KKEVEE VE VEDERRGRK* KVDMEV 
KEEEKDKGEEKSGEEQ*KESAEAAVRE* 
VDS/SELVTFAVLPGDSSEEEQGMMVPR 
A 


6033 


19934 


A 


6085 


328 


1 


SNILIFKAQVTKHIYSLETCSPDYPGIS 
QAIKANFSPCTVG*LPSNWAYRHIjPPCL 

dnf*ifcrg\rsilcrlisnswlqqvkl 
qefpqflgcharfqgagflshclv 


6034 


19935 


A 


6086 


299 


46 


lddlhllslclslfrdrvslcylgwssv 
a* s * l»tva\ * tpvi lslrlk* s shlnlp 

SSWD YRWSNS * PQGLLLPRSPKVHLLL 


6035 


19936 


A 


6087 


266 


48 


GIEIVRQMRSRTGADKVIYIVQSPLLKM 
QKI / WLGTVAHTCNPSNCGGRSGRI T *G 
QEFKQSSHLSLPKCWDYRL 


6036 


19937 


A 


6088 


2 


185 


ARVTMLPRLISNS \GFK* SSHLGLRKCW 
DYRCEPPCPAEISFISSGYLTRSGIAGS 
YCNSV 


6037 


19938 


A 


6089 


520 


297 


LG YRDFLrHLS FP I S / WNYRCLP SCPA/ N 
FFILVETN* FHHVGQAG F EHL T S GD LPT 
SAS Q \ SAE I TGMT YRAQPAAGS 


6038 


19939 


A 


6090 


388 


187 


SQHLGKLSREDCL/RLIVPDQS \GQRSK 
TPSLKK*F*K*AWW*HMPWPAIQEAE 
VGGSLEPRKSKLQ 


6039 


19940 


A 


6091 


188 


329 


TLGLKRFFCLSLR* FIYFFFRDRVLLYY 
PGWKAVGIHRRDYSMLYSLELLASKDS / 
SCLSLRSGWDYCVWLIFFGCM* CKRQDL 
SIIPWQRSLIGQ 


6040 


19941 


A 


6092 


124 


342 


RP AWP T WQNP I S AKNTN I TWA * AARVAG 
IT\GTCQRAQVMFVFLAEMGFCHVGQAG 
LYPKI I ILYDKLSHYKCSHLQMRKALET 
YSGIYSLFTIYLV 


6041 


19942 


A 


6093 


169 


2 


SH* PFTSS TIRS I WT \ GAVAHTCNPS S W 
GE*GGRMA*DQKIETSLVNMTRPHLYPR 
A 


6042 


19943 


A 


6094 


2 


324 


FFFLRWS L / DS VAQAGVQWRDLGS LQAP 
PRGFTPFSCLSLPSSWDYRRPLPRPANF 
FYF* *RRGFTMLARMVSIS * PRDLPALA 
S QSAG I TGMSHRARP AVY ILTSTSN 


6043 


19944 


A 


6095 


76 


340 


LRKKFCFSVYPFIHRWTFELLPLNDFLM 
Nb xj?K1 Vi?Rl? jWK.KLyK\WPGAVAHAYNP 
S TLGG * GGR IS* AQE FKTS P GN I VRD F I 
STKNM 


6044 


19945 


A 


6096 


349 


84 


FLLSLF*DTISLCHPGQSAWRTMTSQL 
KQSSHRSLLSSWDCRCVPPCPWFFSFC 
RDE/SLAMLARLVSNLWDQALVAEAGVS 
VWARQE 


6045 


19946 


A 


6097 


346 


184 


LTMLPRLVSNS \GLK* SSHFGLPKCWDY 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine 5 
G=Glycine, H=Histidine, l=Isoleucine, 
K=0Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T==l nreomne, V=Valme, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *—Stop codon, /=possib!e 
nucleotide deletion, \— possible 
nucleotide insertion 














RHEPPCLAKISLYITKFFHILSLPI 


6046 


19947 


A 


6098 


378 


250 


FKQFS CLSLP I S WDYSREPPQMANS \ * F 
LVETRFHHVYSYVP 


6047 


19948 


A 


6099 


3 


225 


GFKRFSCLSLLSSWDYR/HPAP YLANF \ 
*FLVEMGFHHIGQAGLELLTSGDPPTLA 
SQSAGXTGVSHHAQPLI IF 


6048 


19949 


A 


6100 


1129 


457 


YCVS S KYRR IISFSLIA * LLLTKLGS / V 
QKVGVQWRDLG * LQPPP PG\ FKLFS CLS 
LPSGW\DYGHVPPHPA*FCVF*WEMGVS 
PCWLRLV\LNS * PSGGSRPPSASQSAGS 
LQGLI \HRARPEH* F*TS * PLGEGKTLD 
E I Y I KTKTFYQ F C I VCLKLH * RLMEFAL 
VAQAGMQWHDLGS LQPLP PQ FKWFS CLS 
LPSS*DYRYWLLYHPALELKEAPSFPES 
PCTH 


6049 


19950 


A 


6101 


3 


296 


E 1 1 Q ALL I T I VLGL YRTLLQAS * YL * S A 
FT I S DG I YGS T FLAA\TGLHGLHVI I GS 
TFLT I CFIRQL I FHFTS KHHFGFEADA* 
YWHFVDED*LFLNG 


6050 


19951 


A 


6102 


1 


749 


RHEGGLFKTILLI *TSLTIGSLALAGLL 
FFTGFYSKNHI IE \TANI S YTNA* ALS I 
TLIATSLTSAYSTRIILLTLTGQPRFPT 
LTNFKENN\ PALLNP \ 1 1 YVAAGSLFAG 
FLITNNISPASPFQTTIPLYLKLTALAV 
TFLGLLTALDLNYLTNKLKI KS PLCTFY 
FSN I LGFYPS I THRTI P Y\ LGLLTSQNL 
PLLLLGLT*LEKLLP\NTISQHQISTSI 
ITSTKK\GILKFYFLSFFFPLILTLLLI 
T 


6051 


19952 


A 


6103 


377 


1 


SKAFGPPGF*APYGLKAHCFPPGFRRGV 
WAPSGFWARP P IG YPFGAL IGAP VWPS P 
GGPPKPGPP\GPFGVPKLGGKRLGIGGP 
FPGSPGFLTTPGSKKKKSTKKGKPQKKI 
TPLDTSCSRTIVM 


6052 


19953 


A 


6104 


1001 


531 


FFLRRSFTLVAQAGMQWCDLGSLQPPPP 
GFKQ FS CFGLLS S TO YRHVP PTLANFS V 
FLVEM\GFSMLARLVSTS*PQ/CDPPAP 
AS Q SAG I TGMSH CAQLPKP I FN YL * AI D 
S VTTS I S LARDWPNS S ENRLLGYMGKPS 
KFKRQEQKMECYRDLAPS 


6053 


19954 


A 


6105 


376 


117 


CSDWSAVSRSQLTAAPNS* \VKQSSCHL 
SLSSCWN/YRCQPRHLANFVLFCYVLF* 
RWALAMLPILVLNCWPQAVFLPWPPKEM 
GLQA 


6054 


19955 


A 


6106 


689 


433 


LCHPGWSAWQSWLIATSPSLV\KPSSH 
LSLLSSWDYRCTSPCPAWFFVFFY*R*G 
STM* PRLLWNSWAQAILPPRPPKVLGLQ 
A 


6055 


19956 


A 


6107 


2 


176 


ARE YPANF F VCLF FL I LCRDG / D FTMLL 
RLISNSWLKRSDRLSLPKW*DDRREPPH 
QAY 


6056 


19957 


A 


6108 


328 


69 


LYI FS FG* ERVLFCHPGWSAVAQSQLTA 
AS TFRTQ VLSL IGR WDMRHVPPHLADL / 
SFYRDGVSPCCSG*SQTQAILLPWPPKV 
LGFQE 


6057 


19958 


A 


6109 


777 


329 


FLEMGSHS VAQAGVQWRDLGS LQ P P PPG 
FKRFSCLSFPSSWDYRRPRPRPA/NFFV 
FLVETGF \ T I LARL VSNS * PRD P PAS AS 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














QSAGITGVTYPRPALTQNFSNIiTFGEEN 
RKPNPEDSCVRYLLLS I RQRS PTALLHV 
ILFQQMIQLVL 


6058 


19959 ■ 


A 


6110 


345 


309 


ASL I FV* LVEMRFLHSGI SLFNSGKNGP 
TAFSQDPPG\SSVSKKAIIIIINKINK* 
IQHVTVIQIEVQWHNHVSLQPLNPELNR 
FSRLSLLSSWDHRHMPPHSANF*NFKN 


6059 


19960 


A 


6111 


332 


3 


RKVLVCPPGLSAVAGFQVPARSAFQVHP 
LFLEAEVAVSPGCAPAFKPGPQGPF*KK 
KEEERKKKGRRKEEE/EEEEEEEEEEEE 
EEEEEGEEGEEEEEFYFLRAGKVKSPRA 


6060 


19961 


A 


6112 


498 


191 


STDRVLLCQKARVQWCDPGSLKPPPPGF 
K*FLCLSLPKSWNYSHAPPCLANF\*FL 
VETG F CHVGKAGL E LLAS S D P PALAS Q S 
VEITGVSHHAWPMQEYFQI 


6061 


19962 


A 


6113 


1381 


172 


RNIDS KAILiS KKNKAGGI TLPS FKLYYK 
ATVTKTA*YWHK/NH/CVD*WNRIENLE 
MRPHTYNF\LIFDEPDKNKQWAMDSLFN 
KSCWDNWPAIWKRLKMDAFLMPYTKINS 
RWIK/ DLtNVKPKTVKILEDNPGSTIQD I 
GTGKDFMTKTPKTIATEAKIHKWDLIKL 
KSCTAKETIKRWRQLTEWEKIVANYAS 
EKGLKSS I YT* FKQI *KK\NHP IKKWAK 
DMNRHFSKEDICGQQSY/VKK/CPASLI 
I REMQ I ETTTRYHIiTPVQMAI I KMSENN 
RCW*GCGEKNMLIHCWWKCK/liKriVQPL 
WKTVW/RIKTEIPFNPAISLLGIYPKE* 
KS / C/ CYKDT CTRMF VAA\ CP S VTDW I K 
KMWYIHTREYYTAI KRNKTDWAWWLMPV 
I IjAL C E AE VGGS LE VRS s r PAW p t w 


6062 


19963 


A 


6114 


3 


155 


HEKKI SQVWWCMPWP/TILWWLiR*EDS 
LSPRKIiRLQ*SYDCTTVLPAWATE 


6063 


19964 


A 


6115 


47 


368 


STPTPSAFLCLAF/YRQRWLCWPDWGA 
V\IQHTYSLPT*LSlLiKQD*PLNLLGT/ 
WSSRRVAACPENQI YRRNGVS /MLPRLV 
LNSWTQVILLPWPPKV/LGLQA 


6064 


19965 


A 


6116 


1 


592 


RQR I FLrLE CGGAI SAHCNLHLPGSSNS P 
AS VS * VAGTTGVRHHAQLI FVI LVETEF 
HHVGQDGLDLL /NLM IHP PRP P KVL* LQ 
G 


6065 


19966 


A 


6117 


1 


306 


LWKTLWQFVLKLNILLPYNPAIVPLGIY 
PNKMKI YVHTKTCTQICIGALFI IAKT* 
KQPTCSSIGEW/IKKLWYIQTMEYYSTP 
ERNEPS SHENTWKNLKCI 


6066 


19967 


A 


6118 


2 


284 


QTGVHWHDQ\NLL*PETPG\SRDSPGSA 
S Q VAGTTGMHQHARLNF F SFSFFLSFFF 
FFSLGKTLGLGLEFFPRWLQMSGPKKIL 
RLYLQNPKNKG 


6067 


19968 


A 


6119 


2 


315 


S R VAG I TG VGHHARL I F VF / CF FFFFLE 
RDFLFVPRVGFQGPNLG*LKFPLPGLPL 
F S GLTL P KTGD * GAP LQPRVN FW I F * KK 
G \ V* YCGPGWFE I LDLRGS P P F 


6068 


19969 


A 


6120 


42 


392 


LQWRNLCSLQPLPPVFKQFSCLRLLSS* 
DYR\HAPPRLG*FCIFSRRIVFLHGWVR 
AGSRISTS * PQDGSAPPQASQLAGITG\ VS 
HHHTWAPHFLFFKNSGCSTQNR*QVHKV 

GDKETS 


6069 


19970 


A 


6121 


2 


202 


VHLSDQNQP KCP * M I EWIKKMWH I DTME 
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YYAAI KKDKFMS F \ ET ITLGE ETQEWKT 
KHRLLSLIRGS 


6070 


19971 


A 


6122 


355 


161 


FIXYLFFFFFFFFFFFFFLFFLSFLFFF 
NVYM*IYMF 


6071 


19972 


A 


6123 


1 


133 


LC*PGWSR\FLTS*SACVGLPKCWDYRR 
SPLHLVFFFFFFFFTL 


6072 


19973 


A 


6124 


3 


212 


HPGLPSSWDYRCT*PCLANF/SCCCC*R 
LGLTMLPRLASNSWTQVILLLWLPKLAQ 
NTSLINNFYIDQMLK 


6073 


19974 


A 


6125 


156 


2 


ALLTHAKNKAL \ WLGAVAHDCNLSTLEG 
* GGWI T*G*EFNIS LGNI VKHHL Y 


6074 


19975 


A 


6126 


345 


34 


HPSTIHRVFCSL*S*SSPQARNNGSCLS 
SEHFG I TGVNHHTWLLFFKKN Y / CILIK 
I ETGS * YVAQAGLELLSSSNPP I SGSE S 
AGFTGMSHGIRPVS PSLKWS 


6075 


19976 


A 


6127 


2 


328 


ARAYLPLCFF IIFFI* VS VI YFIWLL * G 
LI *LNKRYLAHPK\ Y* LNVS YS VS FCKT 
HFFPRHKEKAIIQLGWAHACNPSTLGG 
QGGW IT *GPEFETI LTSMEKPRLYY 


6076 


19977 


A 


6128 


1 


341 


GTRKFSDYRGKPPHPAKNRNLLFSIFFY 
FETLS CS VLLCEVTSSQL/ CNS FHFPDL 
PSSNDPPS SASRAAETTGACHHAQMIF * 
FFVETGPDWS*TPG/LLAVLPPRSPRML 
ELQA 


6077 


19978 


A 


6129 


264 


2 


TAI S ENKVASRS EHRRHRML * LQ VI /LI 
ALQHRSQRDNLSH* KN* KKKMLGEVAHA 
CNPSTMGGQGGWIS *GQQFQTSLSNMVK 
PPRA 


6078 


19979 


A 


6130 


1392 


1157 


LSLPKCWNYRREPPCPALCCPGWS*TPR 
LK* S S PLGLPKCWD YR/R* TTAPGLFF I 
S KSSLRS PAYS CVT VCTLQLSHH 


6079 


19980 


A 


6131 


383 


108 


MRLCSQPHHHVDFSVLETESCCVTQVGV 
QWHDLSSLQSPPPRFKRFSSLK*LGAIL 
LLPSSWDYRCTPPHPAN/LCRF*HGPVC 
LYMFGEKVTP 


6080 


19981 


A 


6132 


307 


1 


LKTLPALCELESHSVSQWRNLSSLQPPP 
LRFKRLS YLSLPSS * DHSHP * AHQANF\ 
C I FSRDRETGFHHVGQENL I ML I VQQS S 
GLGFS I FATLLAVMQL V 


6081 


19982 


A 


6133 


351 


66 


RWSFTMLHRLLLKS *LQ/ CDPPASASQS 
AGMTGMSHHTRLHNLF* TANNLTQSHTK 
FYSFISLTPHFAINVTNYIFLYYTSINI 
VCIFLSFKFYARI 


6082 


19983 


A 


6134 


2118 


1392 


FFCKGK*IFIQCPDWGG/DLCV*QEKIQ 
KKTYHTGKDKYV* L F * KVKP I TKI * RQ I 
HTSGKTDAQLSYKENHDQGH* QAKNFKF 
FLFFWFYLRRSL/NSVTQAGVQWRDLG 
SLQPPPPGFKQF/SCLSLPSSWDYRRPP 
PRPANFC/ 1 FE*RWGFTMLARMVS I S* P 
RNPPTSASQSAGITGVSHCTRQEFKVKT 
G YHFHLL P S G CLFL F Y VF YL YACLL YVS 
VY YPLRQQE SRKI S KI EMFLTT WH 


6083 


19984 


A 


6135 


383 


47 


HLFLNF YY YLSH Y3NTYLY YFLLHL IPSSP 
KIFFFFT*KIIFFLIKFNKVF*NLSF*K 
KI F/ 1 FSYTF* FWPP *NFF*KPPPFFFF 
FFFFFFFFFFFF*QTLFLGGCGYNTKLR 


6084 


19985 


A 


6136 


2 


196 


TRPSILVTALYYLYIFTTTQWG* \LTHH 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














INN I AP S FTRENTLM F I HLS P I LLL S LN 
PDIITGVCS 


6085 


19986 


A 


6137 


700 


433 


FLRWS SAS VTRLE * SGAI SAHCILCLLG 
SS YS PALAS * VAGTTGTCHHAQL IFVFL 
VETGFL.HVGQDGLDLL/NLMICPPRPPK 
VLVLQA 


6086 


J 9987 


A 


6138 


377 


298 


QMKWKLLLLFRYRVLLC/HPWSAVA*LQ 
LTAASN\FGLK*SSCLSL/MLTMLVLNS 
WPQVILLPWPPKVQEL*AIAAYTIKMWV 


6087 


19988 


A 


6139 


1 


199 


LYTTYFVTKTLLLTSLFL* IRTG\YPRF 
RYEQLIHLL * KNFLPLTIiALL I * HVS I P 
ITISSIPPQT 


6088 


19989 


A 


6140 


381 


28 


STPNRIilERQEDPPES *NTNPPGLPRKT 
PSFTKNPKINLARWGGPENPPNPEEKPR 
KVPESPKRKKP*TKIRPLPSSPGDPTKP 
P \ L KKKKKKKKS WP ATQE AE MGGP LKA 
RSLRL 


6089 


19990 


A 


6141 


1 


209 


LKLLASSDLSTLGSQSAGNIVGMSHCTQ 
PKINMGILLKLVYRFNINPVKTPAGLF/ 
TKKKT * KTE I DKR I LKFI WKCTEHR I AK 
QSLKRTKLEDFLLNFKTYYEATEIKTS * 
YRFN I NP VKT PAGL FKKKKHE KQKLT S G 
S 


6090 


19991 


A 


6142 


3 


282 


QG I FliTKRVKNF * RGKFKTL VKKNQGD P 
KKGKNPPGPKIGKNNFGKTPFWAKKI *K 
FNLI\PKKTPPFFFQKLKKTGVKFFWAP 
KGPKGFLSKK 


6091 


19992 


A 


6143 


382 


158 


FFF*DSliAVSAPGVQ\WHDLCSL\RPLP 
PRFRRFWCLSLLSGWD\YRHVP\QYPAK 
ICIFSRDRVLPILAKASRNVSF 


6092 


19993 


A 


6144 


185 


316 


PCLANF* IFFCK/NGGFSMLPTLGLNSW 
AQG I LLPKPLKVLGLSA 


6093 


19994 


A 


6145 


489 


231 


S VA* AGVQWHDHS * LQPQ / PLGLKQ S S C 
liSLSNSWDYRHMAPHPTNFLKKFFVETG 
VLSCCPGWSSTPGLKGSFCLNDPKF 


6094 


J9995 


A 


6146 


179 


1011 


GTILNPQSLPFKPFFFSTLIFLKTVLLC 
CPGW VSNS SGS S CRS LP KCWD Y * HE P PH 
SAL*TFPWPFP*LVKDTSIHSTQLLKAE 
IFSASLDAAPSPTPYIRTHPSSGLQRWL 
IjDAARSPTPYIRMHPASRLQRWCSTCTL 
LPSPSYHTPA*AMTTASSLISPFFPLPP 
PAHFSQCRMTFCLFVL/CLFEMESRSVA 
QGWECSGAILGSIi\QPPPPGFK/RGFS c 
LSLLSRW\DY\GHAPPCPG\NFCIFSRD 
G\VSLCWPGWSRTPDLWHPPRPPKALG 
LQA 


6095 


19996 


A 


6147 


348 


3 


GYHKTINWHCCMSVFANLTCGFNAIPIK 
/ IPAKYFVDPDKLVLKFTWKGKRPRIAG 
TILKE KKVRRL I LPNFKTHYKATVI KI G 
W * W * NNRHNH WNQ I GS PE I APHKYSQL I 
FDKEA 


6096 


19997 


A 


6148 


22 


228 


THRVALFVRT*NWKQPKCPSTGEWLNTL 
CYIHTIEY*YYSAIKRNKLSIHV r KT\WI 
DLKGIVLNENNQI 


6097 


19998 


A 


6149 


406 


279 


RSIQGSSHSHASA/S*VAGITGARHHTQ 
VI IVFLVETGVMALTY 


6098 


19999 


A 


6150 


406 


10 


MRYHFTPDERLKSKGLBCLEYW*GSGETR 
SLIHCWLEYKMSHPLWK* /TWLFPIKL 
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NKNVP YGPAI SLLDIL *EK* KHMSTKNT 
YK* MLQALF 1 1 TKNWLLS I KRRMTKQIM 
VYSNRNYYSTIKNELPIQTRI 


6099 


20000 


A 


6152 


367 


1 


PTPWEAEAGGSFKPRR/LQ/CPN*PPMC 
S CT P AWAPGS P P VS KKKKMQ FHRRMGHV 
S SAWWP CGADVRNLRAGGYVRHRERF PG 
LPWAAGGRDEVIRGLTVCTACGEDHPGS 
HIiAGPRGGGFP 


6100 


20001 


A 


6153 


441 


38 


ILNPGEAITSEKYVQ*FNEMYQKAKRL/ 
QVttRKGPVLLSYNA*PQVAQPVSQKLNK 
LGHEVLSHRPYSLSLSPTDYHFFKCINI 
FLKE KR F YNQQDAENAFQE F TK S * S A/D 
FCTTRINTFISHWEKYVDYNRSYFD 


6101 


20002 


A 


6154 


308 


121 


GESFPAEKTFPKENDYRWPSFFQWSSVL 
/CKIHGLDMLPRLV*NSWPQAILLPWPP 
KVLGLQA 


6102 


20003 


A 


6155 


1 


420 


QNLQPHWKTLRQF/LYKVKHMMLSPAIP 
LLD I T *GRCKPCP YK/ES I C/RMFIEAL 
FI IAKKLKLSKCGSAGE* INKV*DIYTT 
EYYSAIKSNKQLTHTTTWIKLKSITPSE 
* SQT * ETACFTMLL I WHSRKGN I IVTEN 
RSMF 


6103 


20004 


A 


6156 


77 


415 


TLGLKPTSSSHLSLSKCWDYRQEPPCPT 
SEANFFFFFGGEFPFWSPS *LKQFFFVE 
MG\FTMLPRLVLNSWPQAHLILP/ASAS 
QS VG I TG KSHRAP P Q KL T F F F FLGGNF P 
FGPQAGCPGAESNYPEP*TPGIiKAFFGL> 
NLS KCLKYGD S PTAPGKLE I FFFKWGGV 
S P S CAS RVLNPQPR 


6104 


20005 


A 


6157 


436 


206 


LWY I HTME Y YS ALKKNAI LIHTI \ST WM 
SLENMLNEI KQENHKRANI * FHLYE I PR 
KESRMEVNQGLGRRGIWEFTVY 


6105 


20006 


A 


6158 


416 


50 


RFFTHPPAGEFFSPPPPKNIFFPPPP*K 
FWGGGGKASPPQKKFPPLPPPQNGVLCF 
P PHKKK * F FP P P E KT VE PGP LHVGQRP P 
QS\PFLCYPPLSFFLYFCYFIFFPFVFF 
HFLPFFFFFIR 


6106 


20007 


A 


6159 


443 


145 


FFFFSIRFLFWWLYQG*EYLCFFMT* 
* IGVLLFLFEICEFFFFLLVGILFGMFIi 
/YFFATASPGLFFFCFFFIFFFFFFFFF 
FFFFFFFFFFFFFFFFFFCEKS 


6107 


20008 


A 


6160 


322 


2 


TQDKNKRHRARH I YKRARKS ARD * RS L V 
HR*K*KVQKRNEY*PEGTLHTCIALSQG 
NN I * DA/ HSQE * W I TPVI * TLWEAEAGA 
SRGQEFKTSLANTIVKPDAWANAW 


6108 


20009 


A 


6161 


421 


57 


FCEMKFTPGNNAVNI V* PTTKDLEY/ W I 
NI/DKTVAGCEKTDSNYERSSTMGKML\ 
SNSIAWYTEIFRGRKSQLMWQISIiFSFK 
KIi*QPLQYSATIALIRQQPTTSRQDPSP 
AKRLRLPEGSGDH 


6109 


20010 


A 


6162 


333 


41 


GAWACHLHVGAALNAPTPGLSRFHEEPG 
WSCRTAGGGSRLELSVFFVQMGS CHVGQ 
VGFFC I FGR/DRGLAMFSRLVSNS * TQG 
ICPPWPPKVLGLQA 


6110 


20011 


A 


6163 


2 


423 


KINNTKSQCFENlLKIEKTLSNIiTKKE \ 
REDTN Y * Y WE *NMD I TVES TD I KRI IRN 
TT\ETLHKFDNL 


6111 


20012 


A 


6164 


434 


5 


ICHIFFSISKLLNILRPVSKLFFSINFI 
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FGPELYYL* IHFNP* *GKCLSFL>FLQNI 
CIGFFFNSFMQI*SYFLRISLFIPVSPI 
LLG\ FVQ I F IVS INILYLS I * LRLFFS F 
FliSLFIiFFFYYYTLS FRVHVH I VQVS Y I 
RIHF 


6112 


20013 


A 


6165 


1238 


1017 


STGSHS VTQAGVQLHNS SS LQ P AP PRLK 
* SS YLSLLSSWDYRRTPPQPAKT\ FCIFG 
RDKVSPCCPGWSRTPEFR 


6113 


20014 


A 


6166 


504 


210 


FFFKTRSHDQGSLQP*PLGLKGSSCLSR 
TLPHS WD / YRV/ P P PHP AKK YKN I F I FF 
VETEFHHVAQAGLKVLDSSNSPVSASQS 
VGITS I SHCAQPILCQF 


6114 


20015 


A 


6167 


258 


65 


YKTKNTTSIFSLPSSF*TQILTF*NTYY 
YYY/CRDE/SLAMIiPRLVSNSWPQVILL 
PWPPKWELQV 


6115 


20016 


A 


6168 


1 


156 


LARMVS IS* PHDPPAPASQSAGITGVSH 
CAQLFFFFFWRQPRFFLRAGGQGRDLGK 
VKPLLSGVRPFLS * TLPTEGRKS GPPTH 
FG* VLRE * FGGSH\ P VSSLW* QS AGITG 
VSHCAQLFFFFFWRQPRFFLRAGGQGR 


6116 


20017 


A 


6169 


3 


285 


HASAHASGNILSYVGSFISLTAAI \LI I 
IMI * EAFAS KRNVL I VEEP S INLE * LYG 
CPPPYHTFEEPVYIKSRRIRRSLIDQDA 
ARKETQQGAAD 


6117 


20018 


A 


6170 


645 


15 


LPFIYCSPCRNFNSLFWQERDSNNPLG/ 
ISLPHSDKITFNP\YYT\IKDALGLTSL 
HSLLKDLHSFPHETS*GPQAYYSL/QPT 
PLNNPSPHARPEWIFPYCAYHNFSGSGP 
LTNLGGVL \ ALLL \ S I LHP KQ * FPHPP S 
YPNKQRH*YFGPLRPITFIDFLGRRPSF 
I SNRESGGQPVS YP FTI IGQVAS VLYFT 
TILILIPTISLIENKILKWA 


6118 


20019 


A 


6171 


409 


25 


YTTPGGKI FFKSNPGSKLFPPPPF/ PPF 
FSPFSP*KFFFSPKPLNFFGGFFPFFPP 
PKKRFFSKNPPLVFFSPPFKEKVFPFPP 
PIiNFAPPRVFLKAPPPFFFFFFFFFKGG 
E VKWLS EAliDCKS KDRG 


6119 


20020 


A 


6172 


73 


52 


VQ LL KCR I H I F * DKVLLCH P GW S T VARS 
*LTATSRSSCLSPPSSWDYRRVPPCLAS 
F/SF*YIALNN*A*AIILPQPSKVLGL* 
AVAHAYS P S TIiEG 


6120 


20021 


A 


6173 


325 


470 


IFTFFFLLAFI*FFCRNR/SLALLPRLV 
LNSWLQAVLPPWPPKVLGLQT 


6121 


20022 


A 


6174 


2 


228 


TL.EDNSTIPCKTIQSSSCTPWYVTQKAA 
C I TLL S M * P KKLW Y VYT KS CTQM F I VAI 
FI IAKNWKQPRCLSVSE * INKLWYI * TM 
E * YP VLKRNELSSHE KTW / MEM* I QLL I 
ERSQS * KAWCLHKKLHTDVYS SYIHNC 
QKLEATRMS FSKRMNK 


6122 


20023 


A 


6175 


3 


408 


R YLTIoAI RDMQ 1 1 T P VS YHF VP S GD YGQ 
KDJNJNKCW * RG* KMGS LMHC * W * CKMVQP 

lwkaawqflk/dvkrrailflgiyprem 

KTYVHAKTCMHIFTIiALFLYVHLNALIS 
VMDWMLGHPIEEISRDSYSCVCLL 


6123 


20024 


A 


6176 


2 


237 


WVWWL I P WLAAWEAEAKGLLELGS S RL 
* \WTMPLHYSLGNRARLCLKKKKKKKFK 
VFPPQKKKKKKKKKKKKKGGAL 


6124 


20025 


A 


6177 


387 


154 


AWQWRDLSSLQAPPPGFE*FSCLSL\W 



920 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyla!anine, 
G=GIycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, IMKMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
A—Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, ^possible 
nucleotide insertion 














D YRHKLLTSGDLPAS ASQSAG I TGVSHC 
AQPLSLFVFFLTNQRFVAALS 


6125 


20026 


A 


6178 


1 


239 


FFFFEMECLTVSQAGVQWYNLHSLQPLP 
PGFKQFSC\LSLPSSWD*RVPTSRPAKF 
/ CVIF*DGVSHCQPGWSAVVQPPLH 


6126 


20027 


A 


6179 


325 


50 


RKFPLTWRRVSFFSLLRSSTDWMPTPHR 
H I GG I I C VTQ S TD SN I GL I Q KH PHRHTQ 
KNV* PNV\WHPMAKSS*HPKLTITVYPL 
STWHTYTSL 


6127 


20028 


A 


6180 


157 


338 


S QHFGRPRQEACS S S E VRDQPGQQRE FP 
HL YKN * KE I T Q VW WHMP P E AQD Y / LE P G 
RSRLQ 


6128 


20029 


A 


6181 


3 


273 


HEGQSKLETFWKGFTI*DVIKNIDDL/W 
RGVKI P I LTGVWE KL I PTL INNFE VFKT 
S LEE I TEYWE IGRELELE VEPKDVTEL 
LQSHEKT 


6129 


20030 


A 


6182 


2 


359 


YYPAI PLLGIHPK* KNS I YQMHTCTHWF 
TAVLFT I AKI WNQL * CSSMGEW I KKM / W 
MHTHTHTHIHTMEYYVPIK/NEIISFAA 
T WMELGVT ILTE I I QAQKKTKG I FSHL Y 
VGAKIFDYM 


6130 


20031 


A 


6183 


3 


211 


NPGKLSVS INQDCFPWTPCPFQMGRCPT 
LLF\ S FYGGQAI PL I TPS VGTQ I FRLKM 
/ PEVTHHFHSFP * EPLTAAALNQPSLPL 
SLLVDF/RDRGLFCHLGWSAVD* SHLTA 
ALNS\LVTQLSCPSFLSSWDYRRMLPCL 
GNF* I FRLKMLKS LT I FILFHESH 


6131 


20032 


A 


6184 


707 


384 


LQIFFYNRVKCVKNKSTQNQFYIFSRIY 
MHANRHGKYE i/WII KKMPMREG* AVQL 
RMKWKIKKNTHI KLGWVHGYTTLSTLG 
G QGGR I T * AQE FKT S LGN I ARLHL YT 


6132 


20033 


A 


6185 


720 


363 


FFFWGRSL*TILF* /QTGSHSVAQSGVQ 
WHHHSLLQP*TSGLKQSSHLSFQSSWDH 
RCAP PRVANFFLD FCRDRVCVAQAGLEL 
LASSNPPTVASQSAGITGGSHRTWPFWD 
ILGIMYWGC 


6133 


20034 


A 


6186 


439 


66 


KPKTLGS KKWSNPRVPPGK/ PWGVGPL 
GPGP *MGGSKP/S *NPSPPPPLFKTRFG 
FFPPG*RQGHYSGFLQPPPPGGKTFFPP 
S PPKKWGHKKKI PFQKKKKKERKKRNI I 
GSWTTSETVGNPAPW 


6134 


20035 


A 


6187 


83 


287 


NKMFKI FRFRF/CG FLGQSHSVTQGGVQ 
WHDHS SLQP * TPGLKRS SL PTCWDNRHK 
PPHLVLFCFYYRL 


6135 


20036 


A 


6188 


290 


609 


RRMDD P P F F KYF FE RRVLVCHQG CS * V \ 
K*SSHFSFPCSWDHRCV/PMPHPAN\FN 
FFHTDKSLTMLPRLVSNSWAQVILWPLP 
P*VLGFAGMSPCPQPPSQFCIEAV 


6136 


20037 


A 


6189 


2 


371 


RYIFEKVFAKYSELTNDSLSILSGKSGI 
L W KKRL SLFS*Q*S YKWF S LGA I YCGMH 
x JvKr xj(jlfc>±ij/ b ±i, x ViilGJt'HHVDQGGL 
E LLTS CE TLAS A Y * SAG I TG I S HCAES Q 
STLKKYI PGSTI 


6137 


20038 


A 


6190 


531 


205 


RWRFALVAQAGVQW/HDLGSLQPLPP* F 
K* FICLRLQRSWDYRCAPLRPA\NFFVF 
LVETGFHHFGQT\NLQLLTSGDPPASAS 
QSAGITGVSH*ARPWFIIFTALLYK 


6138 


20039 


A 


6191 


52 


375 


NRCETSFLFFFFEKKFWFCPPGGGEWRN 
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LK*MEP\LPPGLKEFSCLTLRRGGN*GP 
FPPLQENFGVFRKKGVPHCNPG* FQTPD 
PRELARLTSPRGGDGGGRPPPRVK 


6139 


20040 


A 


6192 


3 


158 


FF FLLLNCGL KKE TG / WLG WAHACNP S 
TLGGRGGWITR*ELKTSIANMAEPY 


6140 


20041 


A 


6193 


2 


54 


FF FE TES HS VAQAG VQ W CDLGS LQP PPL 
GLKEFSCLSLLSSWDYRHTPPHPANLFV 
F/ S FLVE TGFRHVGQAGLELLTSGGP PA 
S AS QS AG I TGVNHRAWPE VFKK* RVS FC 
RPGWSAW 


6141 


20042 


A 


6194 


1 


323 


FILP S S WDCMCAP P / RPDWVL YFLVETG 
FHRVGRAGLQLLTAS DLPAS AS * GAG I A 
DGVSLTQCSMLPMLECRGVILARYNLHL 
PAACLGLPKCRDCSLCPAATPSRK 


6142 


20043 


A 


6195 


619 


243 


S PE TGS C S GAQAG VQ WHDLG S LQP * LGL 
K*SSYLSLLSNWDYRCA/PPRLANFLFF 
VQAGS CHLAQAGLELWS S CHLPALVS QN 
AG I TGVS HHARLAVAWKRT P L YGLP SRI 
SNLCVLLKFLFFYLY 


6143 


20044 


A 


6196 


385 


19 


GGH FGAQ E I S P P G VQLF FRLH F LSNWG F 
KRPPWPPG*FF/VFL*KRGFPPFGRWP 
PSLFP/SDLPPPAPPKGWVSGLRPCPRT 
FFFFFFFDKEC/RLSCPANFCIFSRDGV 
SPCWPPWSRTPDLQ 


6144 


20045 


A 


6197 


498 


437 


ENLQSSINLVDKAMAWFERIDSNFDRSF 
TT VKMPSNLTS I INACCGE I FHERKS * S 
IWQTSLLSYF*KLLQLAPPLLATTILIS 
KEP * TSR/QDRLLLAEG* D 


6145 


20046 


A 


6198 


79 


394 


SVKLLALFFFFFGENFCFAPQVELKGPN 
LG*LNPPPPGLKNFSGLT/LPKKWELGG 
HPPPPSKFFFLFF*KKGMAPFSQGWFKT 
LGLNQFACLTLPKGGNLRREGP 


6146 


20047 


A 


6199 


2 


313 


SASHSAGIIGESHCAWPIMPVYSLVLNF 
FFFLEKKPPFVTQLEGQGGNLG/SMEPS 
PSGLKGIFCLGPPKKWELRAPPPCPGYF 
WFF * KKGGLP I RPGRFWKS VP 


6147 


20048 


A 


6200 


62 


732 


AFQQARKGTQ \ GRLS S PAAGALP \ P VLA 
PSGPPLPA*\GSSS\PASVAPYPPPQAT 
G/SPPTPPGELRRGGWSKGGLDHPRNTF 
YCWESSEP/PPN*GQS*APGGVNSEGRG 
Q/LPPSSIARCVP*TPSHVPPHPLAFPP 
SLPHPGPASRAPP/TGSSDLSASAVSAP 
GREGHP * GTRM VP QGAGRQKP GHRGAS * 
GSSCSKRTLSPLSPRQSGFQSRCCFHPP 
RQNHSRC 


6148 


20049 


A 


6201 


424 


7 


KNMKRSPISCVTRDPQIKITMR*HYTGI 
KMANV/RLTGNTKC* *GYGAKGPLIQC* 
WECKI V* PLRKTAWQFSQS /LKI V* /PY 
DPAIALPGIQLC*KCTS\KKNCM*IFIT 
ALFIMAKKKQKPRYPS IGEWKYKLLHL 


6149 


20050 


A 


6202 


371 


144 


RWAL P RRTNF \ * F F VE TRS LHVAQGGVK 
LLGSVSPPISPSPSAVITGVSYCAQPYG 
HPFELSFSLIEYEYIYLDFF 


6150 


20051 


A 


6203 


409 


110 


LRRS S * LCHLGWS AMVQS WPT * FKP *LL 
GLSDPLTSASRVDYRGMPPCPRNFIIFD 
M*FI * CVCVC/RTWGLIVLPRLVSNFWP 
EVLLHPGPSKVLELQV ! 


6151 


20052 


A 


6204 


15 


221 


KASRKMQ I ETTLRYYLS PMRLAKMQML I 
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NTSCW/RKSQETRTLIC*W*EGKMVQPL 
SKGIWPYLIQAEYP 


6152 


20053 


A 


6205 


23 


371 


KVACTCMFITVLFTIVKL*NQSRCPSTD 
DWIKKM/WLCVCIYSSVCVCLYVCrDR* 
IERHDI YVYIMEYYLiALK* /DEIMFSAA 
TRVKIiDAI I * S * ITQNVKCCI VSTHSGN 
LIMVST 


6153 


20054 


A 


6206 


113 


417 


S AVTLLVI YLNDLKT YVHTKTCTQMFTA 
ALFI IAKI *KQLRCPS \LI WIN* C*KEM 
S YQ AMKRS YARNLW S S KKL WRNLKCLLL 
RERSQSEKSAYCVIPTI 


6154 


20055 


A 


6207 


510 


167 


GLGQAISPSLPKSQGIGAYGFGFFFFFK 
R*DLVCCPG\SLNLVASSQSSCLSLPKH 
WDHE CE P P CIiAF FFFLL * RQGLAMLPR I 
L PLL KALAS GGGMAKKKC I R Y F S MK I HL 
QAQ 


6155 


20056 


A 


6208 


418 


40 


S YTHTHTHTHTHTHTHI P * * LTGS / RLQ 
GICRKGSRKHPRGLSQLLP 


6156 


20057 


A 


6209 


1 


258 


SGFLLS*VPSVRDPTGNRTV*LTWQPLP 
APLEF\CPRLSD 


6157 


20058 


A 


6210 


229 


310 


D FSEAQKS RQCGGG I SNVNHPG WIjKP V 
IQHF*DGVLLCCPGWSAVV*SQIjP*LPR 
LK*SSQLSLPR/SWDYRHVLQ\HPANCF 
VFFVEMSLAMLPRLVSNSWALAI LLPQP 
PKVLDDWLEHHTWMIYITYATTTLP 


6158 


20059 


A 


6211 


496 


1 


CCQESFE I LGSINS FGPGLKKGGVKNMR 
PGAGPTLNKRPFSDNPLQPGRRGGCFPF 
VSQRGSQKDRVPPKWFFFFFSF*DRVSL 
WPPGPSARKPSWLPVASTSR\V*QSSCP 
PDWFPGPPTMVSYFFFF\KRQDIiPMLPR 
LV*NFWPQVILLSQPPKVLRLliQARA 


6159 


20060 


A 


6212 


3 


340 


VTAG 1 1 1 1 FMI * EAFASNRKVL I V * EP S 
I * SLQV/ * WRGLLLLGSPPPYHTLEEPG 
YVTSKRKRKESNPPKLASSQPHGLHDFF ' 
KKKKRGGP FKR I LEGAP FGGGWGHREFF 
P 


6160 


20061 


A 


6213 


60 


493 


KSQQNFFCFETESRSVTQAGIQWHHFSS 
LQPPPPGFKVISYL/SLKSNFSGYP*DF 
FCNGPPCS*GPIRFTYYICVIPSSWDYR 
RLPPHPANFCVF\ IKTVSPGWSQTPDLR 


6161 


20062 


A 


6214 


414 


1 


L CNGRPGLL VS P L P LGE ELIPCLPR* KL 
GPPRVL*\KGGPFL*SSFAPFF*EMESH 
SVT\RLECSRAILVHCNLCLPGPSDPPP 
SAS*VAGRIARAQKVEAAVSHDHDAALQ 
PE QHS E TPS Q QTNKNNQ TKLTQK YP F 


6162 


20063 | 


A 


6215 


2 


403 


SLLLPLVQLPLRLCFFFFLKRGFNFGAP 
DKREWGEFG\*LKPWLPG*NNFSGQTLR 
R S GD * RKT PHGGVNL VF * EKTG F SNLGR 
GGLKPPPSGNTPAWPPKGVGITGGTPPP 
PPVFCFLGFCENTNLRAKKKFI 


6163 


20064 


A 


6216 


2 


388 


KMEQTLWKTDWEFLTKLNILYPYSPAFA 
DLGI YPNELKT/C/ CYTNTC I*KS I APL 
F I VARTWKQPGCPLRDGWLNKLGC I QAL 
E Y YS P * KRNEL *NHNMAYKKLKL I FLLR 
QEDRFSLGVQEQFKHNGEM 


6164 


20065 


A 


6217 


363 


302 


NKEFKTTLANMGEPPFL*KRGFS \MLAR 
WLNSLFHDS PPPAS QKAGI PGVS PPPR 
PLFFFFFLLETVSHYVAQAGQLISNYDL 
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KLPS CLSLLDTWD YRH* P P YPVFKLLNS 
TYKAC 


6165 


20066 


A 


6218 


576 


6 


RAVLRGLLVLLLTKHTFFLFPPRQSLAL 
LPRVGVQW\HDL\GSL\QPL\PPG\FKQ 
FS\CLSIPSSWDYRYPPPRPANF\*FSV 
E TG FHHVAQAVLE LLTS S DL PALASQ S A 
GITDLGQSETIHGGSGLDKPPDHKVDKG 
PTEERACLLSYSAGRKYKEYHILPEQGS 
EPPHHGDILSISFQAASHIAQTPPAHT 


6166 


20067 


A 


6219 


44 


496 


GFRVDPRVRPRVRLNS * PRDP PTSGSAG 
ITGV/ SSHHARP*NIFT*KLGISFLVFW 
GFLFLFFVFCFFFFWKGGFLLVAQVEGK 
/WPKFGLIEPSPSGVKEFFFFNPPEGWD 
\ YGLAP P P G V I F W VL I KNG VS F F CRAWV 
QTSGLSWINRLNLP 


6167 


20068 


A 


6220 


1 


421 


QRGRLTAI SRETSFTLVAQAGVQWHDLG 
SLQHLPPGFQGFPCLSLLST\TGNCSVT 
QAGVQWCHLGSL*PLPPGLKPSSHLSPS 
SSWDYRL*KPHQGSD*VCVT\GAASPEP 
NTMTAWNKPPDIRMKETLLLPPSSSEKM 
L 


6168 


20069 


A 


6221 


3 


36 


AYCIVKPS P * PLTGALSALLMTSGLAM * 
FDFHS I TLLI LGLLANTLT I YQ * WRDVT 
RE S T YQGHHTP P VQ KGLR YG I ML F I T S E 
AFFLAGFF*AFYHSRLGPTPQLGGDWFP 
TGI TPLNPL* VPLLN\ TWE * LA* GVS I T 
*AHDP 


6169 


20070 


A 


6222 


895 


13 


ANSWANN* FELATQ I AFSGRIPLTDSHL 
GLLKAHVEAP I \ AGL I VLAAVLLKTGGL 
MLYTASHSFSNPLTK\HIAYPFPVLSLI 
EA*FITSSHLPSDKQDLKS\LMAYSS\l 
SPHSPS*LTPILIQTP\W\SFNGAGQFS 
IIAHGTYHAYITYSA*ANFKITERrSQC 
AC INPAAQGLSNS *LQLIGLFE* LPRQA 
\SLTMGLTPPLINPTGENCSVASNQRFS 
/WTQISLLLPYRDSTILGSQALYSLYIF 
T TTQWGS LTHH INK I KP S FT RENTLMF I 
HLSPILLLSLNPDIITGFSS 


6170 


20071 


A 


6223 


68 


405 


HS YALVYNPVGIRVKVHLSKC I LYTOH I 
SNSSQEVCEMPNLNLKRVPTTLAPRF*N 
IS*GISQLGAVAHTCNPSTLGG*GRW/I 
QQLKTSLANMAKPHLLLKKKKNFPNFPK 
RG 


6171 


20072 


A 


6224 


591 


283 


GTSTQAGVQWCNCGSLQPPPPGFKRFSC 
LNLPCSWDYRRVPPHLANF\*FLVEMGF 
HHVN/QAGLELLT S SDP PALAS QS AGIT 
GMSHCARPRVAF * GEKNHRS L 


6172 


20073 


A 


6225 


302 


173 


GFFSFFLKC*DR/SLTMLLRLVLNSWAQ 
VI LPLWP P KVLGLQA 


6173 


20074 


A 


6226 


417 


81 


TQTRGPPPPPFLGGPPSGPERTVPPPRG 
G VPQTG S RF FLFRQNPG PAL FF KN" / EGG 
F*KKFGPPLPPF*KTPFFFTNRGNPCPF 
F VGENP QAP P VFGG FNP F P RKKKKRND F 
LS 


6174 


20075 


A 


6227 


41 


474 


TDATETFHFTSKH\HFGF*AAA*YWHFV 
DW* L FL YVS I Y* *GSCVPHPA 


6175 


20076 


A 


6228 


503 


28 


AAHAPPRVRKVIGH*AFFTQRCTNSRH* 
1 1 KFPKDKDKARILKGAREK/RTTYNGA 
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PIWLAVNFSVETLQ/A/RREWHDMFQMP 
KEKTFYLRILYPMKISFKHEREIKTFPD 
KQKPRDFISSRPVLQEMLKSWPGARVHT 
GNPTLLGG 


6176 


20077 


A 


6229 


418 


175 


VSPLSPLKILFFPKGLKFWREGGPKCPP 
PKKRVLiPKK/ PPAMF* IPPPKRKKSKLP 
PPRKIWPPQGIFKSPPPQFYFFFFFF 


6177 


20078 


A 


6230 


45 


308 


AC VRLKH * TDN * Q PN I YNQ P / HKS LL P * 
KKKKKKKKKKKKKKKKKKKKKKKKGGGG 
FKKKP GGG P I S WGGE K 1 1 F FF FGGGKKN 
TLGIF 


6178 


20079 


A 


6231 


440 


209 


IQHFRGPSRRGLLGPEARASPGQHRESR 
SFLKMQP\NKKFS*VWWCTPAIPASPKV 
E ARE S LE PGRRRLHLTEALQPG 


6179 


20080 


A 


6232 


419 


2 


S S P P P P GGKF F * KKPGKKKFS P P KKKRV 
FSP P \ PPKKFFFS PKA*NFWGGGGQKFP 
PPKKGVFSKKPKGVFKTPPKKKKKFFFP 
PGENWGPPRVF*KGPPPFFFFFFFFFFF 
FLS FNVNI LFKP VQAPCLT 


6180 


20081 


A 


6233 


448 


112 


QSARITGVSHRTRPMLRTSTPLSATDRN 
AGQKI S KD I ADLNNNEQDHRD I YRNSTQ 
* QQNTHS FQAAIRMKIDP I LGHKKMLTH 
/ CKQSDQTWNKLE I SNKYAKPQKIKHT I 
K 


6181 


20082 


A 


6234 


2 


351 


MDQGII/ SDFESYYLRNTFCQAIVARDG 
DSSDGS GQRKLKTFWKGF S LLDAI KHTE 
DSW/EEIKISTGVWKLIPILVDDFEGFK 
TSVEE / VADVMDLSRELEL* VE PEDGTE 
LLQSHSQT 


6182 


20083 


A 


6235 


3 


252 


FHTYNSLLVCHSNLCYQNSLLFFIIH* / 
CI /HFHS * AKVS I WKHT YTHTHTHTHTH 
MHTHFFQATDRKI CVNVYMFSVAYIHKG 
LQ 


6183 


20084 


A 


6236 


3 


294 


I ITF * LPQLiSGYIEKSTPYECGFDPI S P 
ARVPFS I KFFLVAI TFLLLDLE IA\LLL 
P LP * ALQTTNLPL I VMASLL * III LALS 
LAYEGLQKGLD*AE 


6184 


200S5 


A 


6237 


381 


121 


P VPAS F* FFFFYRDR \ VSLCYPG WSQTP 
GLKQSSCLGLLSHWDYKYDPPHPAFISF 
LLSS SLLWVL *RP WLLGATVPAFLD IR 
EQ 


6185 


20086 


A 


6238 


395 


62 


PFFPIPKFFPWGPNFFF/PLGPPPFLGF 
FPGSFAPPSHFSPPD*NFPRPPKNFPPG 
GDLFPQKKGDWFFPKKRGVFFFIiGGFSK 
FSGGPPFSLGPQKKKRFKIKGPKKFCF 


6186 


20087 


A 


6239 


1066 


761 


S FFLRWS F I LVTQAGVQWCDLS SLQL P P 
PGFK*FSCIiSLPSSWDHRRPPPRLANFC 
I F/M*RWGFSMLARPVSNS * P / CSDPPT 
PASQSAGI TGLSHRAWRLVFL 


6187 


20088 


A 


6240 


390 


27 


GELPTPIYLAPPASPKGGVSGGSPPSRV 
KFFPLEVPPPPPPGC*PGPFFYIWPQGG 
ARVP * QQ I RFFSLRW\ PPRPTRPKNPHL 
QVGPLRICLSPETGGGVIKVFRGSLLKK 
KKRAAASRSN 


6188 


20089 


A 


6241 


366 


1 


KI KNFDRP I/TSKGA* S W/RNLPTKKS 
P E PNGFP G VFCP T FKKKLTP VF / S KL FP 
KF*KEGILPNSFF*SSLALIPNPFYPSK 
KKKKTKKWYRAI SLMMIDEKIH YKY*QT 
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EFRSTLKVLYTITK 


6189 


20090 


A 


6242 


23 


439 


IASGRPFFFFFFFFFCLPPPPGGNPSFS 
FLKKKKKGGGGGENPPNPPGKAPNPQGV 
GVPPFPPPPVFQKNPPPPFFF*KNPPPS 
PKGPPGGPPFFFPPGGGGGPPPT/PYFL 
KKKGGWGGP PFF* KTNPP P RGEITPLVG 


6190 


20091 


A 


6243 


20 


261 


WAQE I LKKFS CLSKLSSWNYRCAPPHVA 
NXFVFFVETGSYHVAQSGLELLDSRDPM 
TGESHHAQPHFC* ILYTGESQRIL 


6191 


20092 


A 


6244 


376 


76 


LLPRLECSGVIKPFSHLSLPSSRDYRCV 
/ PTMPNF*KFFVELKFNSL*R*GLTMLP 
RLVSNSWCQVIPLPWPPKVLQGCATTPG 
LTDFITTFKGYRHGDTY 


6192 


20093 


A 


6245 


222 


233 


KS FYFALI FEKNFCYVKKSQVGLVFFS K 
VKVLVRPFFPMVIKKNQP/FNPL/RFFF 
FFFQDRVLLCHPGWSAW* S KTLS 


6193 


20094 


A 


6246 


2 


358 


EIILTFQSYY*ENTFNEAITAIDSDSSD 
GSGQSKWKSFWKRFTILGAIKNICDL*D 
* V/ KI STLSGVWKKVI PSLMDDFEGF\ K 
TS VEE VIADWE I AR* LELE VEPEDVI E 
LLQSHDKT 


6194 


20095 


A 


6247 


3 


322 


PDHSLLQTPTPGFRRSSHLGFSSSWDYR 
LEPPHSVCVCVCV\FCVYIYFFFFLEKT 
GFGFVPQAGLNLGVKRI FKA* PPGGVE I 
PGLTHTPGP WGGP IKGKGAWPAP 


6195 


20096 


A 


6248 


1425 


1280 


SLDFQGLALPGWSAW* SCLTEASTSR\ 
VKRTSHPSLPNTWVYRHALPRLANSVYF 
L*RWGLTMLPRLGSNS*TQAILQPWPPK 
VLAL * ADEEM KAKGH I S TKWGG QT LNP Q 
PDCGATSVYSLPIRKFQ 


6196 


20097 


A 


6249 


189 


336 


HAPFPVSVISLSCQKRRR\WLAMVAHTS 
NPSTLGG* WGW I A* GEE FETSQ 


6197 


20098 


A 


6250 


2 


300 


FLPSFMGFWPVLGHGRASTVTPSEPLSL 
PFLLPAEPFLFFFFSLFRKRESIFGA/P 
GGLEGGQFNFLE PS PPG I KE I FWPNP PR 
GWELKIWPPPPINFCDF*EKTGFTMVAR 
AGLKL*TGGLEGGQFNFLEPSPPGIKEI 
FWPNP PRGWELKIWPPPPINFCDF 


6198 


20099 


A 


6251 


1 


198 


LTGRL/ S KCE WS WRFDVQLKDLKRWQN 
HLLL S Q QLAC I VLT SAG I MDHKEMR * KH 
TGGKSLGFFF 


6199 


20100 


A 


6252 


11 


214 


SLRLKSSSHLSLPSSWDLRCELPHLAN* 
KKKKKL/CKN/RSLTGLPKLVSNSWV*T 
ILPPRPPRALGLRA 


6200 


20101 


A 


6253 


402 


3 


KKGFVLS PQVEDKDAISGS FNPGPPGE K 
IPPPQPPEEVGPPGGPPQTRGNLGI/LG 
N*GGGGGF*PPGAGKSPAPGPQKG\PFG 
KDKREKPYGPPPKLFFFFIiRQSLCFAQA 
GVQWCDLGPLQPPPPRFKQFCL 


6201 


20102 


A 


6254 


362 


78 


VGVN / MTFFPQHFLGLSG I PRRYSDYPD 
AYTT*NTVSSIGSFISLTAVLIMIFII* 
SAFAS KREVISVS YAS TNL.E * LHGCP P P 
YHTFEEPTYVKVK 


6202 


20103 


A 


6255 


357 


2 


DLSCHS HGE ATQKHS E KCKAS CTHMF I T 
AliFLIAKHWNQPTC/PHTMEYNSVIK/N 
EQTIDTTTTWMDLKCIMRHEQRQS *KNS 
VLVCFHAADKDI PETGQFTKERGLMDLQ 
FHVAGEASH 
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nucleotide deletion, \=possible 
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6203 


20104 


A 


6256 


2 


305 


Q WCNHS S LHSNPGERET VS S L KLPKAWN 
PRHMLLCPTTS FI LRWGFI YVAQAGFDT 
RGHASL/LA*ASPSVGITSKSQRVRPVH 
D I ACFHRE S TMFS AENFN 


6204 


20105 


A 


6257 


335 


3 


GPFQSPPPGVTPISCFTPPKKGGNRGPP 
PRRGKFWI FKKNGGSPG* KRRVLFPDPG 
VPPPRPLKKGGLKGGAWAKIFFFFFLRQ 
SHSFTQAGMQWCNLGSLQPPPPGFK*F 


6205 


20106 


A 


6258 


219 


2 


PTLSSQKDTKAQRLYSFPLTTTKDERIiV 
TLVS/WPGWAYTFIPSTLGGQGRWVT* 
AQKFE TSDENT VKP PS C 


6206 


20107 


A 


6259 


306 


117 


SFELLGSSDPLISAL.SSWDYRCAPSRLA 
NF*FFVCVEMGSSYIVQ/SWPQ*ILLSW 
PPKVLRLQT 


6207 


20108 


A 


6260 


572 


269 


ETDILSVAQARVGGLHLGSLPAPPPGFT 
PFSCLSCSS *DYRRIiPPRPA/NFFVFSV 
EMGF\TVBARIVSIS*PRDPPPSGLP*G 
ALGLPGVSPLRPALCLLLK 


6208 


20109 


A 


6261 


357 


116 


FAQAGLELMGSSSPLASASQSVGITGMS 
HCA\RLLLFFCGGRW*GG/LK* FSCLSL 
LSSWNYRHEPPHPANLCIFSFNTISY 


6209 


20110 


A 


6262 


3 


337 


HERHERHERHEQ S AE ITGVSHHAGGG I G 
LLNF FLFLFFEAGS CT VT * T/GV* WHDH 
GPQ*PQ I PRFK* S SHLSLHS S WDYRHVP 
P*LGLQACATMPG*FF I FCRDP I SSINK 


6210 


20111 


A 


6263 


515 


150 


FLRWSFTIiVAQAGIQWCGVSSLQPLPPG 
FKQFFCLSLPSSWDYRCPPPCPT3STF\*L 
LVEMGFHHVGQAGLELLTS GDL PASAS Q 
SAGITGISHRARPGTLFF*AANGGGKQV 
FLLLRVIiNGL 


6211 


20112 


A 


6264 


1 


296 


GTS FIiPSFLP/LLSFLKFLNFFQ I VSHC 
RPGWS AVA* SQLTAPS I \ FGLKPS SQLS 
RLS SWDYRCPINFF\SLPMLPRLVSNS * 
PQAILPTQPPKFLGLQA 


6212 


20113 


A 


6266 


298 


182 


LHFFEVSKIIKLIRAEAKWCL*VLFTIA 
KI W KLH * * I C K YL S I DEW I KKMW Y I HT V 
EYYSAIGNKE I / PVI VTIWI YMENIMLS 
EIN**QKDKCMIALL*GIYIQ 


6213 


20114 


A 


6267 


402 


234 


PPPPLFKNPPPPPFKNPPPKKKKIIFP/ 
SPPYNIiAPPKEF*KGPPPFFFFFFFFFF 
FF 


6214 


20115 


A 


6268 


125 


17 


KGQAQWLTPVIPALREA/EEAGG*LEPR 
SSRPAWAI 


6215 * 


20116 


A 


6269 


1109 


454 


GDS AGAE GTMENFTAL FGAQAD P P P P P T 
ALGFGPGKPP P P P P P PAGGGPGTAP PPT 
AATAP PGADKSGAGCGPF YLMRELPGST 
E LTGS TNL I TH YNLEQAYNKFCGKKVKE 
KDSNFLPDLPGMIDLPGSHDNSSLRSLI 
EKPPILSSSFNPITGTML/VRLPPPHWP 
VAGAVS S DAYS AS QEEE*AQAQTEPYPG 
SCPPRNTI*FRSQEEEKEKRRGS 


6216 


20117 


A 


6270 


111 


86 


WGITGVSHCAWSRNGTLTHCWWECEKV 
/ QPLRKTV* QFLMLKI \PYQMI * PLLGI 
YIiRKNKAYIHMKTLK*MFSLFLFEIAK/ 
TWKQPKYPSASEWIN*LDSFSLYWLGWS 
RTPDLK 


6217 ♦ 


20118 


A 


6271 


2 


222 


RSCESNSQTSACYKSFYLYPLKS/GGFK 
I LERM * P I Y FL I PYWT I FNQR Y YF Y I W 
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peptide 
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Amino acid sequence (A=Alanine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














NLKKTRGIYTVLYKMYKSF 


6218 


20119 


A 


6272 


787 


470 


LPFFFSFFETGSHCWRWERESLGVAQVG 
VQ*CDLRSLQPPPPRLKQSSHLSLPSSW 
D YRCAPP * LDKFFFCI FCRDGFHHVAQA 
GLKLLS SG\NLPAS VS PSAGTTL 


6219 


20120 


A 


6273 


2 


322 


F CMA I AAI DS S S S HG S RE S KL KTLWKRF 
SILDAIKNTCDSWEEV/KVST*TEVWKK 
LIPTLTDDFQRFKTSVEEVTA\ELLWE 
I TRE IELEVELDDVTELLQPHDQT 


6220 


20121 


A 


6274 


175 


406 


KNSLGGPNLTGEGKKITFSL*NFFFSLE 
ALIFVGRLSSFVPPPKIRFFSKNSRNLF 
/ CPVGE FFKTRP PVNFFFS LP * RKS YFF 
SFPG* IWPP*GIFLKAPPFFFFFFFLRE 
RREVSMLPRLMSNSWVQVTPPPWPPKVR 
VLLGIVDVQ 


6221 


20122 


A 


6275 


2 


181 


PRVRGRVGHRKGKKSFVHCWWQWKLL* P 
LWKTVWRVIjKTLK\ IEIiPYYP/ATKLL* 
SYPLE 


6222 


20123 


A 


6276 


397 


47 


PPPSKNFFFPPQKDFFLGGLPKKSPPPK 
K\FFFKKTPKKNFPPPSFFKKPPPPAFI 
KTPLKKKKKFFPPPQKFGPPPKFFKKPP 
PLFFFFFFFFITTAP * KITKS I P YFYFF 
TKTFYL 


6223 


20124 


A 


6277 


2 


264 


ETESHTVAQSGCSGAIIAHRSLNLQGST 
NPPTTAS * VAGTTDACDHAWLMF * FL\ V 
EMGSHYVAQAELELLSSSDPP\PQSPKI 
LGLQA 


6224 


20125 


A 


6278 


3 


204 


I RC YRKI f yerkhqwlqqi s lls yfkkl 

PQP\PPL*ATITLISQQSLISRQDPPPV 
KR**LTEG*DDC 


6225 


20126 


A 


6279 


57 


306 


INKANRDKL/GTIDFLEFLTIKVRKMQN 
TKTESKKRN*ESILHVE*EDTGHVRAAA 
VCRDMTYLGEKLTDEEVDK\ I IREAKYW 


6226 


20127 


A 


6280 


419 


18 


SSSSPKACMFVYEWVCTQKCVWIYEYMC 
VFVYV* LVLMCCMNVCACM\C* SACV*M 
CVICECVCM*GLSVICEGVCM 


6227 


20128 


A 


6281 


459 


235 


FLRNGVS CCHPGWS AWQS * LTAASNS * 
AQVILPS S WDYRP VTSHPAKTHMF YMQ / 
CGNKKMCVTHSI VI FSLLQ 


6228 


20129 


A 


6282 


505 


470 


GVP*GQEFEPSLSHMGRPCFFIiKCKINP 
AWWQKLF*PRRQKFRLTQI/liALAPQPG 
QQKVNF FS / NKKKKKNPNNC * IALR 


6229 


20130 


A 


6283 


206 


374 


GGGGGKKiOSJWGGE KKRGGEKK I FF F * S P 
PRFFFSPPYKKKKFFFPPPFFFPPPNFF 
FYPPPPPFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFLNHFF\FF*LYVIIV 
LFMGHSDILMHVHNV 


6230 


20131 


A 


6284 


431 


269 


YIKGGPPPEIETFLIFFYYIYLiIIFIFF 
FFFFFFFFFFFFFFF*FVCLKYLIF3SrHI 
QRF*TYISIPHFMLLM\YTLHLFTFCIC 
SLFTI I WFYIFV* L* YCCTVKALCNSL 
LVSELDKYIKNNYNCKQ 


6231 


20132 


A 


6285 


298 


44 


YIiFFFFF*VELVFFLFCMFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFEFKNKVSFNS 
CFSLSLFKLYFLKYNFIQLQGNLHFNNI 
I 


6232 


20133 


A 


6286 


386 


1 


ANLHIKVNFWNFTLLNFSFLPVWFGFH 
FLW*RVYLSHRGFYVFFQNFLRIFFFF 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Giycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V~Valine T 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "-Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SF*VG\FFFLFFLLFFFPFFFFFFFFFF 
FFFFFFFFFLCIAIVFY* IWYISYFTKY 
IEEPRTRGRTRGRTRG 


6233 


20134 


A 


6287 


190 


134 


LEFGRNRFQFSRMILRGSGLQWRNYADM 
VKIAKT* YLRNTFHKVIAI IGGDSCDGS 
GQNKLKTFWKGFS \LDAINGIHNSQEEI 
KMATLTGVWKK* I P VLKDDFEGFRTS VE 
E/V/ TADMVKI AKT \LELE VE P \ EDVT * 
LLERILF 


6234 


20135 


A 


6288 


394 


2 


RRQASLSCGGLHPVRSSRLLCLPKQAWA 
MAGAPPPASLPPCSLISDCCASNQ*DSV 
GVGPSKPGVGYHLLVHRFLSPLEKRSIR 
VGVARFSRCRLSPLSLTRKGN*LP\TPC 
ASRVRQCLTLLQLAHGALHP 


6235 


20136 


A 


6289 


233 


58 


DRVSLCRPRWHVVSRTRLTVASTSR/LQ 
AILPPQPPE*LGPQASANRSPRSPKLTG 
LRG 


6236 


20137 


A 


6290 


385 


48 


TLQYLCTPTSTV* KKKKPENIKYW*QCG 
ANRTLFQ * WQVCKVAQTLWKP VWHYTVL 
NLNTRIVYDPGMPLLGTNNNAYMCCICX 
YQKTFTRMFIAALFEITPK*KMHKYLTV 
VG 


6237 


20138 


A 


6291 


273 


36 


RNGRWTPCGGR* IHPS*GTDPVYLPVPG 
TLS S SQASNSHPLLKAPGSRP \ P WPTDL 
GYKDPRGPPSNPACRRRPSLFHSD 


6238 


20139 


A 


6292 


394 


5 


AKIYFS INPLVFSKQ* FLKNVFPNILHL 
QFFSNINSLMLIKV^ALLQDFPLVHNVY 
NK I CD TS KDTTTL FM YLMF VFKMKTF FS 
LKAYI /WSGTVTHTYNPSTLGGRGMWIT 
*GQEFKISLANMVIASLY 


6239 


20140 


A 


6293 


536 


190 


RRS FALAVQAGVQWRDLGS LQ P P P PRFK 
RFSCLSLPSSWNYRHAPPRPANFVFLVE 
TGFLHVGQAGLELLTSGDPPVLASQK\A 
GMTGVS HRVQ PD * GHGGS RL S S Q P FWE A 
KTGGS P EVRS S RP A 


6240 


20141 


A 


6294 


2 


238 


HEADLGTTQVLTHPSTTAMYFVHYCQPP 
*ILYGTINT*PPVVHKNPIHIKTP/SPC 
LQASTAIMPQLSHIMC\NPKDTPH 


6241 


20142 


A 


6295 


2 


447 


FFFLMGVSLFCPGL / WSAMGAI LGSLAT 
SCLLGVQVILLCLSLPSSGKLQVCAYPH 
LG* FL YF * AEDGGFS PCWPG * VSNS * PQ 
VIHLPWAPKALGI TSMSHCCPACFHCTF 
FRHFRQNPHLNLFFFLEGLGLSPPPPSQ 
PLEGGKGPGN 


6242 


20143 


A 


6296 


2 


133 


AE ERERE RERERE RE \RERERERERERE 
R*VERERGGVPLRTYR 


6243 


20144 


A 


6297 


3 


244 


HD YS S FLAP PALGYMRMPLHPAYF I LF Y 
FI S / F YLFFSQDEVLLC * LGWS QTPELN 
QAS CLGLQKW WD YRCE PQ CLAE CDS 


6244 


20145 


A 


6298 


1 


361 


GTRRS AMLARLVLNS GPQ/ C YLPALAS Q 
RAG I TGVS H * GC P F F S LL KKRWR SH C VA 
QAGVQWFFRG*TPGLSCFSCSLPSN*DY 
KCAPLCPP*DTVF*VRAVSSEVPLTITY 
RGEGAGVWA 


6245 


20146 


A 


6299 


3 


348 


HERHELEEL I KNLFFFFFFGKKI FFFFP 
GGKNGGPF*FIGAFSFRPQGIFLPYPP* 
KW\DWGPWPPCRGDFWFF*KKTPFSILA 
RWLNSPPWGILSPYPPKGVRFPGGTPR 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H~Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PGPV 


6246 


20147 


A 


6300 


1 


436 


VDGEFS CLSLP S GWDCRRAS P CPANF / S 
VFLVGIGFRYVGQAGLELLTSGDLtiASA 
SQSPGITGMSHCTRPRKHH* IANSSSVL 
GDFR I DY FQ I SRKAQWI TLG YNPWAE SH 
LLMGAGLRVLPAPRLM I SQVDLLPATHP 
SSGRRG 


6247 


20148 


A 


6301 


370 


70 


PKG*S TLPAS AS R SAG I TAVI L* SHCTS 
PPM*1»RFKHFTHSSTSTPF/CFFFFGKK 
ILI FVPQPGGKGPNLS * LKPLLSWLRKL 
PCLTL*RGGDYPGFPPPP\SYFCFFIKK 
GV* PCGPGGNS * PAS AS RS AG I TAVI L 


6248 


20149 


A 


6302 


1 


285 


KHFFLFTYLFRDRVSVCYPSWSTWE/S 
L*SWLIAASN/CLGSK*FSHLSLLCHHA 
IaLVDFNFL*RRRLIRQPRLVSNSWAQAI 
LSSQPPKALELKT 


6249 


20150 


A 


6303 


194 


351 


GTF I CI KKP VQ / WLDTVAYTCTPSTLGG 
RDGRVT * AQEFETS LiGNI VRPHVYKK 


6250 


20151 


A 


6304 


354 


211 


YRHMP / S *LADFF I FCRDSLFMLPWLVL 
NL*AQAILLPQPPKVLGLQE 


6251 


20152 


A 


6305 


1 


346 


GTSSLQLISLHSSPTASYPPKHLHTRLT 
ELLSVSLMAHVLSVSPATSS /PILQPHP 
IYPCTWIMFTHSSGSGKPSCQTHTHTYT 
HTHTRMCAHGSGHATLPP*HPLHPRHSA 
YLLW 


6252 


20153 


A 


6306 


395 


177 


WLNTLWY IHTVDYFLAMKRSDIiLIHS P W 
MNLQGIMLSEKRQ\SQR*HW*FYLYNN 
VEMKKS QKWRTVLWLPE 


6253 


20154 


A 


6307 


2 


718 


GVLPYWPGWSRTPDLV/HPPTSASQIAG 
I TGKS HRALP \ KFAFLTS S Q VMLTLLAH 
GTTAWFS KLS * F * EPPMTCAVFSWTG 
SCHINS FRIQTCL * YYSVFTS CFYCSM I 
EKLTTGI S * QKEHELPFSQSSD I FKLAY 
LI FFW/ CWEMESHS VSQAGVQWHNLS S 
LQPLPPRFK*FSCLSLPSGGDYRCPPPH 
MANFCI FSRDGVLPC* PGW\S * TPDLRM 
DPPGLRPFQSAGDLTGMEA 


6254 


20155 


A 


6308 


377 


67 


PPPPKIFFFSQTLFFFWGGCPQIAPPPK 
KFFF* KNTPQGFFFPPPKKKKI FFFPPPK 
FLPPQKFF*KPP/LSFFFFFFFFFFFFF 
FFFFFFFPTI* IFYSMSFLP 


6255 


20156 


A 


6309 


1 


362 


QTYAKMHVTI I FFGVK\LTFLPQRFLGL 
SGMPRRYSDYPVAYTT*NILSCVGSFIS 
L TAL I L 1 1 FM I RE AFAS IRQE L I VEE P S 
INLE*LLGCPPPYHTF*EPGYLKPKRKR 
KESSPPKLV 


6256 


20157 


A 


6310 


2 


411 


VLPTLWEAGARGSLEARSSRLQ*AMIT\ 
HCTPPWAT 


6257 


20158 


A 


6311 


59 


334 


THACGNML S YAGS F I S LTAVLL I T FM I * 
SA\FAS KRKVL I VKEPSINLE *L YGCPP 
PYHTFEEPVYIKSRQKRKESNPPKLVSS 
QPHGLHDL 


6258 


20159 


A 


6312 


3 


22 


KPSP * PLTGALLDLLMTSGLSM* FNFHS 
ITLLILCLIiTNTLSIYQ*WRDVTRESTY 
QGHHTPPVQKGLRYGIIIjFITSEVF\FF 
AGYF\WPFTHSSL»TPTPQI*GHWPPTGI 
TPLNPL* VPLLNTFVLDALGVS IT*AHD 
P 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
KNLysine, JLr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibfe 
nucleotide deletion, \=possible 
nucleotide insertion 


6259 


20160 


A 


6313 


11 


182 


INKMW * I * VRG * YS VI KK\DE I LS FMAT 
WMEVEDILLSKIREAQKVTHPVFSFISG 
S 


6260 


20161 


A 


6314 


3 


421 


I QAS TMAGAP PQAS IjL P C S L I LD C CASN 
S / RGS VGVGP SEP CTGYNLLVCLLLRP S 
EKCS I RVGVT * FSRCRP S P/TRKGHS PT 
PCTSRMR 


6261 


20162 


A 


6315 


1 


111 


DKGM1JVFMPANTASVLWPMDQGVILNFK 
SFYLRN\TFSKYISAIDSDSSVGSGQSK 
FKTF * KG FTNLDA I HD S WEE I KI S T * TG 
VGKKLl\PTFRDDFKGFSTSVAEVSADV 
VE I GREL * LE VKPGDRTELL * S VL WPMD 
QGV I LNFKS F YLRNNI F 


6262 


20163 


A 


6316 


440 


312 


TSSFSLSFF/CQDG/SLTMLFRLVSNS* 
PQ A I L P S WPP KVLGLQL 


6263 


20164 


A 


6317 


5 


404 


PR YM I VKLL K I / E RNK I LE AARE K* HFT 
SKGKAIWI IVDLSSETMEVRKNLHNFQV 
LKEK/NKTWE VRI L YSAKI S FQKKGKI K 
IFSDEGKLR * S FTSRPTLRE * *RKFSK* 
K*NDRRRNLKTFGRKKEYPKELKP 


6264 


20165 


A 


6318 


394 


182 


MFIVA/ VFSKAKEWRQPKCLSTDEWV/ Q 
YM* YIHTMKYYSAI KSNELVIHVAA* IL 
FENTVLSERSHIWHGSHI 


6265 


20166 


A 


6319 


1 


378 


APSDFFLFQSLNKPLNCTHFSSVHNVKK 
TSLT* LNSQNPQFLRDEPNGWYHCL/QK 
YLELDGAYVKK 


6266 


20167 


A 


6320 


286 


314 


NRCCFSLPTQVKCFNLYLGICCVCDWGY 
YRWLNF*NPI F* FILRQFHSVAQAGVQW 
RDLGSLQPLTPRPANFYILVEMGF/TPL 
LARLVLNS*PSLLKYKN 


6267 


20168 


A 


6321 


23 


397 


QKTLNMEKEMNIYIHKM*ITPNRSNIKR 
SSLRHITVKFSKVKDKQRI /MKLAIEKQ 
PITYKETTIRLSEDFSAETL/NARR*WD 
DIVRYWLKKKKKKKKKKKKKRGGGLLKN 
S LGGP ILTGGGKE I F 


6268 


20169 


A 


6322 


411 


2 


TEPPPPGGNF\L*TKPGKKNFLPPKNLG 
FFSPFSPLKFFFSPK\P*ILGGGGAPFC 
PPQKKGFFPKI PRGVFFS P P / YKKKNL I 
FPPPVILGPPKVFFKGPPPFFFFFFFFG 
QSGQVKLKSPKCK 


6269 


20170 


A 


6323 


395 


280 


MLARLVSNS * P \ CDPPGLASQS AGITGM 
SHHVPQVNHF 


6270 


20171 


A 


6324 


3 


359 


QH YL VFRE TQ I KAAVR CH Y I PTGMAKI I 
KTGSTKC * Q* CGAAGTLIHCWEECR\LK 
Q P L WKTAW * FL I * ML CD * T VP FLQ I YP T 
EMKMCVCRMTC\RVFLSVLFI IASSWKQ 
PKYHKLVN 


6271 


20172 


A 


6325 


3 


174 


FFFETGPNFVNQAGGQGHDLG*LQPPPP 
\GSRI FPA* TSRVAGI TGLNHHPHPLGS 
IF 


6272 


20173 


A 


6326 


265 


13 


TAS P I R 1 1 1 TSQD VLHS * AVPS LGLKTD 
AI PVRLN\QTTFSDTRPGV* YGQCSEI C 
GANHS FMP I VLELI PLKIFE IGPVFTL 


6273 


20174 


A 


6327 


32 


259 


C Y YEHL CTS LC VNW YFHFF * VYT * E WQ W 
PKYLSADEWINKM*YIWTVEYYLTTKIS 
EVL/SATTWLGLENIMLSKRS 


6274 


20175 


A 


6328 


1 


228 


PGLNHCSHLS WDHRHALQCPANFG \ YFV 
E TRF T VL P GL VSND P P ALAS Q S AE I TGV 
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C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProUne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














SHCAQP*VCFHYCLTFFSLQ 


6275 


20176 


A 


6329 


346 


1 


L P KKKKKKI S RGL F P FQL P KKKKDNE KS 
BTFSSETMPGRRQWSKIFKVLRGKKQPQ 
QPRIISLAKLFLKSEDFL/ETQKLREFV 
ASRPALQAMLKEVTj*REGK*YRSLFKKG 
RALK 


6276 


20177 


A 


6330 


349 


77 


HTTPKKNICHN/RKKIKLDVLRYNKVDDN 
SKMSGLH*QCPSDECDAGVFMASHKQ/F 
DRHYCGKYCLTYCFNKPEDKAQHCGSLE 
PRSLRPAWAT 


6277 


20178 


A 


6331 


319 


46 


RPRRPETK*ALNK* KDI/ PMCSWKQRHN 
TVKMLIIiSKLINRFNAITIKISTE\FHR 
NKLILAI IWKCKGPRI AKTT* KGIKLEE 
KLYLISRPP 


6278 


20179 


A 


6332 


496 


309 


DRATAF * RQS FTLS L / RAGMQ WHDLS S L 
QPPPPGFK*FSCPSLiQSSWDYRCPSPRT 
G* F \ C I F \ VEMGFHHGGQAGLE LLTSGD 
LPASAS QS AG I TGVSHCTWPN I LNWK 


6279 


20180 


A 


6333 


3 


173 


HHHSRIi/ IFFCS VLGTSSHYVA*AGLEL 
LGS SNP PDLAS QNPG I T YGSH YAWPD I P 
VF 


6280 


20181 


A 


6334 


339 


153 


FSPPPPFFKGFFFPPPPPPF/SF*PLRG 
GGPPFFFGPPGVRPPPSPWSWCSPPPV 
VLASFFFS 


6281 


20182 


A 


6335 


32 


331 


QLLNHS VI VFLLS FDHS YFLRNLSLFLA 
FKLI GVKNS * Y 1 1 LS FRYLLHP * HLLNF 
FFF / CKRQGLP VLTRLVSNS WAQG I CQ P 
WPPKVRGLRAWSPGLSL 


6282 


20183 


A 


6336 


226 


177 


DGVSLLLPRLi\GGAISAHCNHDIiPGSGD 
SPVSAS*VTQLQYYFCICLGLFTFTTRL 
S AYLGTGKLTTAL PNT PGLLDS 


6283 


20184 


A 


6337 


302 


59 


KMLLNTIMCNRE Ih/RKRKS PRVHQTS S 
VSYFKKLPQPPQHSAASTLISQQPSSSK 
QDP P AAK * L * F VE GRL I AF P SNNVF 


6284 


20185 


A 


6338 


197 


28 


AEKVRFSSAVTWHLGSHSL**SKKNL/W 
LGMVDHNCNPSTLGGRGGRIA*AQAFET 
G 


6285 


20186 


A 


6339 


46 


317 


EVPSS ISSQISLFSGDFSS VPLEELSNL 
FFMFCPEHI IVTCERVTSNYS / YYKWKP 
EFSYP*IF/CFKMSLTMLPGLSIiNSWAQ 
VIIPPQPPK 


6286 


,20187 


A 


6340 


4 


276 


Q T TLRGT T VQ I TKWLS S ETMNAS R CEKK 
NL* SAWGWGMWENNYQ P ILYSAKI SFKN 
GVE I KT FTN/ KQK I KN/ FITS I TLV* EI 
LKEVLQAEG 


6287 


20188 


A 


6341 


485 


2 


G VWE FLE T E S RS VAQAG VQ W CD LG S LQA 
PP\PGFTPFSRLSLLSSWDCGSPPPHPA 
NFFF VF F VETGFRHVS QDGLDLMVS I S * 
PREPPASASQSAGITGVSHRARPKHSKI 
LIAGSRRS IQMFPVKFFQKLAWENFQN 
TKDFLNPYSEILLHTSQIHSSC 


6288 


20189 


A 


6342 


2 


339 


EDLRVKL KKE G YS N I S Y I WNHQD P MLN 
SNGSVTWALLQAS * YLCILQAS * ASSQ 
SWTS / VTLAAVSFQR I SNQEHI P VYQQE 
EMQTDVWTIiLNGSKDDFLIYDRCGCLVY 
HL 


6289 


20190 


A 


6343 


181 


12 


RWGLPSGWDYRSMPLHPVNF* 11/ CMRQ 
DFTVLPRLILNSQPPVILLRWPPKVLGL 
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QA 


6290 


20191 


A 


6344 


3 


295 


YWHL S C F * FLA I TNKAAMN I RLL WT CAS 
IFPR* RSRKEMAGS *GS /SLFNFVRNCL 
TIFSHDCTTL/HS/HQQWGGV/PVLPFP 
HQHLL * S I FLVFD I LIG I 


6291 


20192 


A 


6345 


334 


8 


KPSNPGGEGPP*KKGPKVFSPWGKKENF 
S PKKKKKI * CPQKP KGPP WGAPAFKKKK 
NF*NPRGSPQL*KKRGIIGPPTLIPPSS 
QRPPPKKKKKK/GPPKGALLAFGGIKFF 
FFFLEKNFLFFPKGKTLLAPFFKGGLPL 
PG 


6292 


20193 


A 


6346 


68 


256 


WVA\PVTHTCNPNTLGGRRGRIT*AQEF 
KTSLGNIVGPHLYKNLKKKKKKKGEKKP 
GGGFS PL 


6293 


20194 


A 


6347 


385 


60 


KKKKKNRTPYDAAI/PSLGTSSKEFKSG 
SSRDICTPMFIA/AITKRWK*LKCPSMV 
NI / VKKMWFIHMMQY*SAKKKE ILS *MH 
LKDVMLSERMLSQSQKKNFCMILLrWGI 


6294 


20195 


A 


6348 


234 


61 


CHR I EH I KMVNL / WLDWVHT CNPSTLG 
GQDTW IA*AQEFKMSLSNTAKPCLYKKY 
KN 


6295 


20196 


A 


6349 


2 


358 


FLSLGIHFNYCLVTTESKLSKLRLTYFF 
FFFLETEFPFCSPA*RPW\PNLG*LKFP 
P PGSRETPALTLPRTWNNRRAP PHPANF 
GFLRKNGV* FFGPGGFKTLDLGGPTRLW 
FQKGGNN 


6296 


20197 


A 


6350 


346 


27 


KMV* S LWKAVWQFL I KLNIH * P YDSAVP 
LIG/ 1 YPT*MTT* IYTKP * TQMLAAVS F 
IMAKNTRNESWIFrWPIDKLWYIHIMEY 
**AFKRN*LLINTKT*MNLKC*LF 


6297 


20198 


A 


6351 


36 


158 


KWS VP S QE * L Y YL \ RQENRLNS GG IG CR 
EPIWCHYTPAWAT 


6298 


20199 


A 


6352 


2 


146 


RSSHLSLPSSWDYRRAPP/H/LANCCIF 
CRDR/SFTMLPSLVLNS*IQTILL 


6299 


20200 


A 


6353 


74 


303 


EPSICGQIYLSFSIYKIQK*/WPGAVAQ 
ACKTSTLEG*GGWIT*AQFKTSLGNMMK 
PCVFSKKKKKKKPHPGGGGRKK 


6300 


20201 


• A 


6354 


3 


210 


NNKKIWTLKNLKFFRDRVLLCFPDWSTA 
VHSRLTAAS\TLGLKQTSCLSLVSS*DH 
RGMPP CLALKRKKT 


6301 


20202 


A 


6355 


2 


627 


GRVGGFLVPMRDLAEASRGQQGQVKLGE 
MKPSVSCVCGGV*CISKFKLKN*L\WPG 
TVAHTCS P S TLGGQGGW I A* AQE FETS L 
DNMVK \ P CL Y 


6302 


20203 


A 


6356 


584 


166 


RQGHNVAQAGVQWRDLGSLQPPPPGFK* 
FFHLSLPSS WD YTRVPPHLANF\ * FLVE 
TGFHHVGQ AGLE L P TL S DL PAWAS QSAG 
ITGVT/HQARLQLL* KELINPSTLGPTW 
ELLEAWGRCPSWMGDDLALNCFHHFSV 
S 


6303 


20204 


A 


6357 


448 


202 


FTTLVSECI*DIHDSWKDV* ISTLTGAW 
KT WL I P T LVNGFEG \ F KT * VE E VTAD W 
EIARESELEVEPEDVTKLLKSHDKS 


6304 


20205 


A 


6358 


3 


237 


DRSSL*P*PPGLK*SYYLSLPSGWDHKR 
A* PHPANFCLFTF / V* QRGLDLLA* HDP 
PASASQGN* ITGMSHRAGPPLTF 


6305 


20206 


A 


6359 


49 


242 


SSYLSL*P*PPGLR*SSYLSLPSSWDHK 
HA* PHPANFCLFTF / V* QRGLDLLASHD 
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PPASASQSTSITGMSHRVWPPLTF 


6306 


20207 


A 


6360 


508 


223 


FVTQAGAR* LDLS SLQLPP PGFKQFS CL 
SL P SGW Y YRHVP /STPIVF/CI FSRDGF 
HHVGQ AGLE L LTT * S T CLGLRS AG I TGL 
SHRAQQERGFEP 


6307 


20208 


A 


6361 


2 


595 


AIWG* GTHFL I P WVQKP 1 1 FDHRS QPH 
NVPVITGSKDLQNVNI IPCILFGPVTSQ 
LPRIFTRIGEDYDERVLPSITTEILKSV 
VARFDAGEL I TQRELVSRQVS DDLMERA 
ATFGL I LDDVS LTHLT FRKE FTE AVE AK 
QVAQQDAERA/RNSLATAGDGLMELCKL 
EAAEDITYQLSRSWNITNLPAGQSVLLQ 
LPQ 


6308 


20209 


A 


6362 


1 


259 


LE S F YFLLY YYK* S S * PLRLKRL ILF I F 
ETECCPVA\QLECSGTISAHCHLCLQGS 
GNS PAPA YR VVG I TDS FR I THCILTC I P 
SE 


6309 


20210 


A 


6363 


60 


221 


NTTKLGMVAHT CNPS I LGARGGR IT * V* 
EF KTS LDNI VK \ P CTP P KKS LICI KLN 


6310 


20211 


A 


6364 


1 


393 


RPRRPQTGVQWQNLGSLQPPHPGFKRFS 
CLDLPSSGDYRHTPPCPANFSGFS/HHV 
GLKLFDSSDLPWPPKFVGIARHESHRKC 
I*RSRRGRRIADPLER 


6311 


20212 


A 


6365 


19 


85 


CXNS KIMQS SRI TTLCKM VQ PL* RAL WQ 
CLIKLNIPHPYDSAIPLLHFYQCK*RHM 
NPHN\ TFTSMF I TALFI TAKN * KQS KCL 
SIGQWMKK* * CIHTVQ*NSGI * *HE* IN 
SKIMQSSRITTLCKMVQPL 


6312 


20213 


A 


6366 


2 


298 


PPRLAKVQKFLLFFYLVF*DRVTLCHPG 
WTAW* SQDTVTLAHRNLRLPSGWDYRH 
VPQRLADFVFL/CR/DRGFSMLVGLVSN 
SRPQEV*ATSPSQSLLS 


6313 


20214 


A 


6367 


67 


636 


QDNKKGIR YLF VYFRQS FALVAQAGVQW 
HDLGS LQPLSP* FKQFS CLS LPS S *D YR 

HAPPRPANFL/ qflvetefhhvgqagle 
LLTSGD*PASTSQGAGITGVSHCTGPRI 
RFLKTFSHLEAPGGFPEKRQPLRSERLQ 
FRAFPQTAPCAHAYLLGCVSHLEMGCRN 
IDIPELSFFLCPLQCLIYRGKSP 


6314 


20215 


A 


6368 


3 


251 


RSRHCTPAWAPLSTQDHFYVFYFFLHFL 
I GS LF I FACFVSSYF IE FSYCL\YFFVD 
SFGFYRYTMK*NLNSNYFKLSASILPQ 


6315 


20216 


A 


6369 


333 


193 


FKKLVLMAKAKYCKANFFFQGVPVASKN 
/ TGDLWEEPGWSQTPDLR* SALLGLPKS 
WD YRHE P P PRLAN F C I / FLVE TGFQH I G 
QAGLALLT /S /GDP PAS AS GALE * LGGG 
SCL 


6316 


20217 


A 


6370 


36 


369 


RPTRPEMRFHPVGQAG/PRSPGLK*SAC 
LSLSKCWDYRHEPP/ l *PG*FSFFTYIYI 
YFFFLFFSFSFLEEMGFHHWKAGLELL 
C*NDPPASASYSDRI IGVSHQARPFFFF 
FF 


6317 


20218 


A 


6371 


2 


375 


DIQMIDRNIKN/CSTSQSIRKLQIKRIM 

* YHLTS VRMAI TKKTTHKC * RCGKK\LH 
C * WQCKL VQPL * KT IQRF VQKTENNINI 

* PSNSKN/WKSAC*RDICTAMCITT*FI 
RAEHGGS \ PCNPSTLG 


6318 


20219 


A 


6372 


75 


372 


NGNSQKKKKKKKKKGGPPLKKPSGGPNL 
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P PGGAKKS FPLKGGQKKP PGGFLE KNP P 
LGGAPL\GPPPPPKKRPRGKTKP*GG*R 
GKNPAKPWLGKKILKP 


6319 


20220 


A 


6373 


605 


350 


I S TG I CLGMF I AAIi F T I AKL / WKAT * P S 
FNR*KKKMCYIYASYSAIKK\DAILSFA 
ITWMNLEKTIMLSEI SQTHKDKCCMISL I 
CAI 


6320 


20221 


A 


6374 


167 


378 


KKS QGGTN I KP P A I T KYL S QMGG * KKLL 
TGFLKKKSLLGGGAGVTILLT\NRTFST 
KINDPDGGGDHILSQ 


6321 


20222 


A 


6375 


994 


747 


VSLALSPRLECNGVISAHCNLRFPGSNN 
S PASAS * VAGAHHHTQLTFVFLVETGFH 
HVGQAGLELL /NLVI RPPWP P KVLGLQA 


6322 


20223 


A 


6376 


1956 


1592 


HEFEMESYSVTQAGVQ*HDLGLLQPLPP 
RFKQL S CLS LP / RYWD YCLAP P CLANF \ 
* YLVKMGFRHVGQAGLKLIAS S DAP AS A 
FQSAGITGMSHRARPKSAFKIQAI IGEG 
GVSELHRKINI 


6323 


20224 


A 


6377 


94 


402 


KTQELQMKI SHCL I RRVDRKM/ WLCTVA 
HTYYPSNLGGQDGRIA*GLEFETSLS]Sri 
TRPRLYRKKKKKKNVFQAWPLWSNFLRR 
LRREIRAEKSF 


6324 


20225 


A 


6378 


463 


146 


GKLGPPKGSLKPPPPFFFFL*R/P/HPP 
PRGETPFFFKKPKKSPRGGGASPPFPPL 
GRVRPENFPPPRGQGSP*PKRGPTPALP 
PGGQKETLFPKKKKRKKKKKYMRLI 


6325 


20226 


A 


6379 


3 


207 


LEYQVAQTMCIISSRIQLVAEKASVPFV 
DKTVLS \TMGQL*EAKAEDGFFVPG*YG 
LAVSPLKFQSTRP 


6326 


20227 


A 


6380 


328 


70 


RPRRRLGLPRSWDYRHEPLHPDQLLVEI 
MSHYVAQAGLKLL\ ITGVSHHARPH/YR 
F*LGHC*SLVRRPWTISYSFLCLGFPMW 
EVSD 


6327 


20228 


A 


6381 


13 


77 


VIFLLQPPK*SSCFSLPNRQDYRRVPPY 
PANF * FS / CRHS LTMLPRLLF * TGLQAC 
ATIPS 


6328 


20229 


A 


6382 


342 


77 


KGQGFYPHLFFFFFFF*KDSPSLPRVEC 
CGEISAPCNL*LFGSNDFP/ASHAIVGG 
KLPKQRKGAFFS * GGGFFFFF* M* KHSV 
LNSSKN 


6329 


20230 


A 


6383 


1 


208 


RGRVDKIFYLFFETDSQSVT*AGV*WLN 
PGSLQPLPPGFK* FS * PQVI \TCLDLPK 
CWYYRREPLCPVY 


6330 


20231 


A 


6384 


277 


50 


LVLKLGRCKTSS F I * NFKGPQ I AKT ILK 
KNKAGSLT \ LPNFKTNYKAT VI KTV* YW 
HKH * YRHG F I SGLS I L FH 


6331 


20232 


A 


6385 


2 


354 


AAAS TNFRDR / IL/LCHPGWSAVAQS *L 
TAVS T LN * AQG I G \ S SHL S LLNS LD YRY 
APPLLAVLLCCPGWPQNSWPQVILLSQP 
PKALGYRCEPPCPAPLLLNFFFFFKNRL 


6332 


20233 


A 


6386 


2 


188 


FFKY I CY/ C INMYMCF YI FINIYSLIYL 
*IHIYKYIFAYIFINTYL*IYICLYIYK 
YI FIYYR* * I YICLYI YKYIFI YYR 


6333 


20234 


A 


6387 


2 


241 


WATKCR*G*RRTGTLTGG*WDCKASGQL 
*KTFQQFLISITY/PYGPAISLLEIDPS 
E LKT YVHTKTRT * L F I AAS F 1 1 TKW 


6334 


20235 


A 


6388 


142 


429 


YCVCIYIYIFLYLFIFFCTFFTYSSLHL 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine ? 
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LIFSFFFFLERGVCFSPRGGGGGKYLGL 
WRFFPPE * IKKNVYI YA\QQYYSAIKQN 
EILSFAATWMELEAIMLSEIRQAQNDKH 
YMISLICGIKR 


6335 


20236 


A 


6389 


468 


250 


VHRRTPPCPIIFFFF/CKQ*VLPMLPKIj 
VSNSWPQATLLPWPSLTVEFQALTPGWK 
NFKKMFLMVMSWS I KKR 


6336 


20237 


A 


6390 


616 


1084 


LCLCQYAGISVQCPDCVFQFFHNVDSTY 
EDYLRKLKTSLSFFWRRSFALVAQAGVQ 
WCDLGSLQALPPGFK*FSCLGLLSSWDY 
RHPPWHPANFVLLVEMGFIHVGQAGLEL 
IiT S GD P PAS AS Q SAW I TGMS HRTQQ E \ Y 
FLNLKFSTSPGELLCLiK 


6337 


20238 


A 


6391 


415 


200 


SPPPRFFFFCPPKKNFFFPPPRSKNFFF 
*RGPPFFFFFKKTPFF/YFFFKPPFPPR 
/RFKNPPKIFFFFNRGPKNPGGGGGKKK 
PPGPGGFSPLWGPPPPFKKPPPPPKKNP 
PIjFGKKKKKKKKKKRAAPR*NPRARGVF 
FFPPPPLGFWAPG 


6338 


20239 


A 


6392 


3 


117 


KLKPS S H \ D Y * VGE TYGKD I TS RGKDKP 
IAVCKTFSPLS 


6339 


20240 


A 


6393 


3 


204 


DRVSLCRPGWSAVA*S*LTATSNS* \ VK 
QSYHLSIP*GSWDYSCVP\PPLG*LVIL 
LRQPPKVLGLQA 


6340 


20241 


A 


6394 


501 


187 


AGVKWYEHGLWQPP P PGLKRS SHLSLPS 
S*DHRHEYPCPANF*KIFF\VETRSHYV 
AQTS I>E FLDS SNP P TS AS QWAGI \ *GMS 
HCAQPMQTFSLVKIGTNFLIF 


6341 


20242 


A 


6395 


3 


393 


HACGAI MNS KPHAVTQ YHTPLLM * S VL I 
TT\VLLLLFVTILNDGITILLTDRNRNT 
TLFYPSGGGDPILYQHLF*FSGHAGG*I 
L/LLPGFGTISHIVTYYSGKKEPFGYIG 
MV* AM I S I GLLG 1 1 V* AHH I 


6342 


20243 


A 


6396 


101 


135 


FDS P T PG I VPGT WQT I RSLNVYLTRLTT 
S FSLMPS * FLF I FFCRDG/SLTILPRLV 
SNSWPQAILSPQPHKMLGLQA 


6343 


20244 


A 


6397 


395 


187 


SSYLSLLGIWSYRHVPPHPANFLFFCRD 
GWYFIFFC\KMGAMLPRLVLNSWAQVI 

IjIjL * p s kvlglqa 


6344 


20245 


A 


6398 


35 


334 


QNETLS KTKKIKNKKKKKKKKRGGPLKK 
PLKG P KL TRAC KKKP F S LKG / WRKKNNL 
GIFEKKPYFGGGKKWNKPPTKIKSFKEK 
KKF*REKGEKQPQNPGC 


6345 


20246 


A 


6399 


464 


146 


HIQIREFQRTPTRMSPKTSTLSHFKIKK 
QLVMNKRS FI KQWI S * QKP CQMARRD *H 
N I FT VQ KGTNCQ I R \ I L CPAKL S FKISIE R 
EXKPFLGKYKLREFITA*FIFKG 


6346 


20247 


A 


6400 


723 


632 


FFN*DKILLCHPGWSAVARSQFTAASTF 
RVQ\QSACLGLPSSWDHRHIPSCIiATF* 
* KRGFAPLPRLLSNFWAQAILPPWPPDV 
\LGLQA 


6347 


20248 


A 


6401 


522 


46 


FFPPFLSFFFFSKIPKQSLGLVFYF*NK 
VWVFCPGWRAIR*FPFPA/APPLGSKNS 
/SCLNLPSTWGYRISGMVPPKAH* IPPE 
I FFFFFF / CRQSLALS PRL/ WQWQVQA 
ILLPQPPE * LRLQVPGRW 


6348 


20249 


A 


6402 


103 


261 


NI PFTCLI KKKKKKKKKKKKKKKKKKKK 
KKK/IK* KKKKNWGGWKKKLGGD 
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6349 


20250 


A 


6403 


146 


384 


GFLPSLSGHKELSLFFLFETESHPVAQA 
\DLGSLKPLPPGFK*FSC\SLPRSWDYR 
RVPP CSVNFCVS PY* PG* SQTPDLR 


6350 


20251 


A 


6404 


3 


395 


E T E S HS VAQAGVQWRN PGPLQPPPPKPK 
RFPYLSLPSSWDHRHMPPCPANSAFSVE 
TGL/LHAGQAGLQLLTPGDPPT*TSQSA 
RTTGASHRARP S LKYFL ISFVI SSLLHG 
LFRSVLGSMYLRISQVFFCC 


6351 


20252 


A 


6405 


30 


205 


CRLTRPAEFRTRKTHT\REKPYECKQCG 
KFFS CKSNLI VHEKTHKVETMG IQ* VMW 
LFL 


6352 


20253 


A 


6406 


3 


444 


ARKITGY* ISKT IVFLYT/DELAEKEIK 
KAIPFTIVPNK*DN*ELNLTKQVKDPHK 
EDYKILMKETEQDTK/WKDIPSSWIRII 
N I VKVAVL S KAI YRFS AI C I K I RVT FKT 
ECHWKKKIWIGKVAHACNPTTLENQGRR 
SLEARSLRLS 


6353 


20254 


A 


6407 


480 


307 


QHDSPSAKKPPQKPDKS/HPSSRS/HPF 
PPPPPC*TPPPEF*YPPPKGEWGARPPP 
QG 


6354 


20255 


A 


6408 


923 


833 


KAMKHFLIKTRAFFRHVTAHL IDQSIVK 
TELLYVTS F I AI FALLWWYGKE PML /WP 
GAAAHTCNPSTLGD*GGRTA*AQEFKTS 
LGS * NKTKP * CLSAV 


6355 


20256 


A 


6409 


382 


152 


ECSGTLLAHCNLCLLCSSDFP/SLFHNV 
SLSSNSSS*VAEIIGLCHHWLIFVSAK 
LFLGRRVRNLFS PTPKNMFS K 


6356 


20257 


A 


6410 


553 


173 


GLLI IKNTLPAKKKKKKKKKKKKKKKKK 
AR/WPP*K*RAP*NPVKTPGGSF 


6357 


20258 


A 


6411 


38 


562 


FLRQ GL / DS AAQAGVQW CNHS S QRAP P F 
GLGRSSH\PASLAGWSAVA* SRLAATST 
FRVQVILLSQPPE*LGLQSCSITQVGVQ 
QWRDLGSLQPPLPGFKRLSCLSLPSN*D 
YRRLPPRPDSSITFSRKFFQSPQARNLP 
PTPELKPQQPVPAPSPSELHLSWSSGNK 
CHFLWFLCP 


6358 


20259 


A 


6412 


414 


144 


FFFETESHSAAKLE C \ JSAHCNLHFPGS 
S KS PASAS * VAGTTS VC YHAQL I FVFLA 
DTGFYHVGQDGPDLMFCPPRPPKVLGLP 
RLPLICF 


6359 


20260 


A 


6413 


3 


455 


AEAGGLLEPRSSRLQ/ CS QP * SHPCTPA 
WAT 


6360 


20261 


A 


6414 


462 


162 


TQGPRLFPHPFGAPGGPVPLGWEVRPPP 
APPGKPPFPPKPQ\NSPPWGGQPSIPPP 
LEG*ARKSP*PPKGLVPVTPKGPPPPHP 
GQENPTPFPKKKKKKVN 


6361 


20262 


A 


6415 


499 


0 


HFTS KHHFGFEAA\A* YWHFVD W* LFL 
YVSIY**GSH 


6362 


20263 


A 


6416 


187 


236 


MCHINGIMQFFFFFFFENEVFFCPPARI 
GN*R/WPNLG*LKPPFPGLTLFSCLTPP 
RSWNKRPPPPARVI FLFFNKKGVFPLLP 
GRV* IPHTPEPLATTNLLSVTIDLLILN 
VSYKWDHAVFFFFFF 


6363 


20264 


A 


6417 


158 


412 


SSDPSRRSCLFSF/ CFFFLEREFFFAPR 
VEWRGPNFCLLTPPPPGKKHFFAPSLRG 
GGDYGHLPHGPVIFGTLRKAGFSPCGPG 
G S * P P P PGKKH F FAP S LRGGGD YGHL PH 
GPVI FGTLRKAGFS P CGPGGS 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaJanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L==Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


6364 


20265 


A 


6418 


457 


11 


HTGQSPYRCSECQK/AFTQKSAITNHQK 
PQQKGRKSMQ * LR I * ES FFD * SHQLRHN 
RSSMNTMIVMGPFGTLVGEKPYRCNEYF 
KPSFLKISLVYRRDLTRRI IGQAHWPMP 
VISALWEAEAGRSliEVGSSRPAWPT 


6365 


20266 


A 


6419 


103 


376 


SALSLILGITFFF* IFLETGSHFIAQAG 
VQW/PIHTCDHSILQPQTPGLKPSSCLS 
LLSSWDYRHVPPVPRTSACFECTILRSG 
SVAHTVIiWKAKAGGSLEVRSIiKLACATQ 
GAPCLTKKKCDPQD 


6366 


20267 


A 


6420 


463 


6 


FPPNPHKKQNQQTP\PPKKAPQQQKYTA 
DP / QD P P GND P KAQAYKKS PAAT Y P AAS 
SSPSSPSS/RPPSTAAHVLRVPL*GSPP 
GLVGNPRPPPPGGAQHPQPPDTWQKRPA 
HHGLQSLSAS PPGI S * EPQLAPRNSPHW 
TRDL 


6367 


20268 


A 


6421 


1 


479 


RGFAVLFSAVEFPSSSKSHLDLETSPKA 
YKHGAPCSAPGSLQASDPKGRNQRNPFS 
NFFVLF*DKISLCCQGWSAVIQS*LTVA 
S TSPGS DDPPTTP P P PHAPDTWDHRHVP 
PHPAKFYIFYRDR/SFTMLPRLVSNSWA 
QTVCLSSLFMLLPSLLWSLSP 


6368 


20269 


A 


6422 


11 


242 


FEIIFSAVEFGWSDHGSLQTQPPG\SSD 
PPTLDS * VAGTTGLHHHAQLMF * liFVE I 
GSPGLKWSSVSASESEPLYPA 


6369 


20270 


A 


6423 


488 


335 


SCQLRPWNESIGQRSGIRWLTPVIPALR 
EA/EEAGG*IjEPRSSRPAWAI 


6370 


20271 


A 


6424 


454 


229 


RDPPASASQSAGITGVSHHARL/VSLFIi 
IVM*AEGCYLSGIWRTRLKMNPPQ*RKE 
RQIQVKSTRSDI FFFYFLFL 


6371 


20272 


A 


6425 


1575 


1309 


FESESHSVT\RTECSGTIS\AHCSRHRL 
GS SDS PASASRVAG I TGVHHHAQL IFLF 
LVELEFHHVGQDGLELLT\*VIHPPWPP 
KVLGLQA 


6372 


20273 


A 


6426 


276 


437 


CFHPPEPWDQQWGPPPLGNF* IFWKKRG 
PPFVPQIILNPWPQKNFFAWPPKTLG 


6373 


20274 


A 


6427 


445 


240 


RPRRADHL/RSGV*DQPGQHGETLRL ) RK 
IQKISWVWWHAGTLWPFWERDGSHRLSA 
RTQMLGSWACSPGF 


6374 


20275 


A 


6428 


114 


450 


LLLLLLFESESCSVAQA/PSAVL/WILL 
GSL*PLLPGFKQFLSLSLLSIWEYRGLP 
PYPANFFKFLVEMGFHHVCQAGLELLTS 
GDLIiASAS QHSGI TDVSHRAQPVT I FLK 
GLS 


6375 


20276 


A 


6429 


380 


224 


LRHCLFTQAGIQRQEHGSL * P * S PGLKQ 
SSHLSLPSSWWY\GVKLPCPNKFQH 


6376 


20277 


A 


6430 


509 


228 


PHNKGRVSPPPPQNFFS PPSRNLGGGWL 
PKIPPPKKRVFSKKPPRGKKK/PPP*KK 
KKNFSPPPFFGPPPKIIKKPPPPFFFFF 
FFFFFFFFFFFF 


6377 


20278 


A 


6431 


3 


117 


HS WES I * KRGACFN I F I AAQ FT IAD I WN 
* FMCL S GD E * I KKLGL I YT/MD C YWGVK 
KKETMPLATTWIELKVNILSEIGQA*KD 
KYHVF*MNR 


6378 


20279 


A 


6432 


417 


271 


LS KKTKFRNS PQPPLPCSFL *NDRK\ WP 
RVWHTCNPSTLGGRGRWIT 


6379 


20280 


A 


6433 


385 


145 


TPGIKQSSCLHLPKCWDFSCEPPHLANL 
NF / CKR* Y I PIVPNSKV* TFS QEVFSYT 
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SEQ n> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eih 

od 


SEQIB 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AJanine 
C=Cysteine, D=Aspartic Acid ? 
E=Glutamic Acid, F=Phenylalanine/ 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














WPHS S S FTGKE VI I LFPKE C PGMFK 


6380 


20281 


A 


6434 


65 


383 


LFPNLLKKKKKKKKKKKKKKKKGGPIKK 
KP*GDKIKTRQKKKNFFPKRGGKKNQPG 
KFKKPCNJSFFWGGKKMEKPP \KKKKTCKEK 
KKI * KKKGEKKPQ I APEKKKGS 


6381 


20282 


A 


6435 


3 


454 


EIQRTPVRYYTK*TLPRHIVTRLSKSTL 
K/EKKKKKKFLRAPTEKGQI PPKKIP / I 
KLT VNF * KKS * RRE KWG\ P I FN I LKKKK 
F* PKI F YPAKLTF IREGKKKFFPKKQGL 
GKFVTTRAPLKKI LTGVLNLKMKKGYLF 
PQNTLKYLAHNHP 


6382 


20283 


A 


6436 


424 


410 


NFFFPPQGGGGGGFFF*DKPPPGNPSSL 
FFFFLGGGKKRGPPPPPKG/ QTPPPGGL 
F*KWLGGKKETPKKFSPPPPFFGGGGKI 
FF*KPPPPPGFFF*KSPPGGGGPPSVKK 


6383 


20284 


A 


,6437 


3 


214 


E I DCCAAVWKQ P KC P SADE * RKKMWYL Y 
TMEYCS ATNKNE I QS F / VNNWKE VE I I M 
LS\KKSQAEKEKHGIF 


6384 


20285 


A 


6438 


83 


420 


QS FP * I S CRKYS / WLGAAAHACDANTLG 
GRGGRAAWGQKF KT S LGNLE PGP KTAD P 
RTSHGCELRRSSYTYALTHGFSSTPETA 
RPAPLLP P P P P PTQRED YKE EDL YDDLL 
NK 


6385 


20286 


A 


6439 


163 


289 


KMYCYYYY/ C*DRVLLMLPRLVSNLWAQ 
EILPPWPPKVLRLQA 


6386 


20287 


A 


6440 


459 


431 


DKRIRTRITPLAIRKRQIKTTV/RKQLQ 
SHYM WVKETTNNK / CGI F IHS WWE CKM 
VQLLWKTVS QVFKRLDLP YNLATLLI F I 
CQL.Q I KTLiS S PKYLHMKVHS III KIRKK 
YI KKNLKCPSTDEWI IKI *C* S IHTMEN 
ERA*K 


6387 


20288 


A 


6441 


478 


36 


VFGCRVILVEKLFLCEPLRVTFLRLGKQ 
LE KSL / WNFHDLELN I S LHKNQ PNRCVH 
QWRIG* RKRS * CSE * *R/TMQFINSNLT 
L IN* I P S CHTAMD III I AQH/ 1 THTVEY 
YSAIKKNEIMCFPATWVELEAI ILGEIT 
QKQSQIWRSPP 


6388 


20289 


A 


6442 


464 


206 


LVQRSGIHLPYSSTVPL*GL/HQKNEAD 
VYRNTRMFTAALFLIVQNWK*HNYLQTG 
EKINKLPYIHTMEYFSVIKSNQHS IT 


6389 


20290 


A 


6443 


414 


216 


CYIYTCPLTQQSRSWVAT*ASSKEEWIN 
KL, WHIHTMD * Y/ S SNKKNKLQ INTTWTN 
LKSIMLTEISLT 


6390 


20291 


A 


6444 


7 


411 


S FYCLHS FAFSRVS YCWNIQ YIAFLDWL 
LSRSNSLHVFSW\WD/CKMV* PL/ YET V 
WQ FLTKLN I LLLHD S AJj TIiLG I Y * K/ DM 
KS YVHTKMCTQI F I AALFI IAKTWKNL * 
CSSVGEWRNKLWYLQTMEYSSSVLKK\N 
ELSSHEKTWRELLRERSQSKKAIYCIFQ 
QYDTLEKAKL.WRQ 


6391 


20292 


A 


6445 


257 


419 


KS LPFFF / CW * ETGSHS VTQVRVQWCNH 
IALQPQTPGLKQPSCLSLQGSWDYKFHF 
TQACPY*RP 


6392 


20293 


A 


6446 


20 


417 


HQYCFASFFFFNNFFKTESHSVAEAGAQ 
WHDLGSLKPLPLRFK*FSCLSLLSSWDY 
RCMPPHLDNF\ * FLVETGFHHVGQAGLE 
LLTSDD PPALAS * SAG I TGVSHRARP VE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ JD 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GIycine, BHHistidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LFLLRRQGLRLLQNIGIGRQ 


6393 


20294 


A 


6447 


371 


3 


VIGCQFL/RKTKRYLPYDLPIVLLCIYP 
R/EKKNLCRH*TCTSMFMTALFIITQH* 
KYPKCHS I GEWLN I LWH IHTME Y YAEMK 
RNKLLIHEKTYMDLKDILFSGGQRVYLK 
SSILSIKCIRRQE 


6394 


20295 


A 


6448 


46 


332 


LGEHLRQGP 1 I LKVCQVILTHQQ YFE S Q 
W S IMAL S NRK YNANCR YNL KC PJNf S Y I KK 
EML/WPGAAAHACNLSTLGGQGRHIA*A 
QEFKTIAKPCLY 


6395 


20296 


A 


6449 


2 


381 


LGSLVFTHI CS I FLFFFFFFKDRALLP / 
H/PCWRAGP*SYLTAASNSG\VKQSSNL 
SRGKPLCRANFGSF\CFWFWKKGGPTMW 
PRLV*N\ ILLPQPPKVRGL*AAAHAGGQ 
GHDHTSLRPQTPGLKQSSNLSRGKPLCR 
AN FGS F W FL VLE KRG S HHVAQ AS LKQ S S 
CLSLPKLGDYRHEPSHRANMFNFRSLPG 
LRKADCLYPY 


6396 


20297 


A 


6450 


2 


136 


KS VGCWRGCG* I GAL/ LHCWWECEMVQL 
LQNAWQFLKNCKIVPHY*A/IPHLGIH 
PNALKTATQRNI CAPMFI AALCATAKRQ 
CK/PKCPSIDKT*YTHTMKYDSA*KKCW 
LLARVWINWSPVHCWWECEMVQLLQNAV 
WQFLKNCKIVPHY 


6397 


20298 


A 


6451 


314 


72 


MLPLTNGSCRTKVSKDIEGLSNSINPLP 
LIGIYRTPFPKTAGTPFFFFFLV/PNGT 
FTKIDHILGHKTRLNILKMI*ILKYIP 


6398 


20299 


A 


6452 


54 


350 


LRSFCSALFCFETVLLCHLGWSAVA*PR 
LPAGSSNPPPS/CLPIPSSWDHRLVPPH 
QVFLEAFFFFFFF/ CRKGVFFFLPGGV* 
NS GVKGT PR PNP P KALN 


6399 


20300 


A 


6453 


368 


172 


WLTSVI \QRFGRPRREGHLRPGA*DQPG 
LR*STHLRLPKYWDYRRESQCPARPVIL 
KEANMFHLG 


6400 


20301 


A 


6454 


1 


365 


NI LKLNNKLLNNL \ VKEEVAKE I KK*LV 
LNKNENTTYKDLWNEAKTVLHL*H*VKF 
I ALSA* R I VSNQ CS K* AS TLRNHKKKSN 
LNPRLI*QQKSITLKREKSMKPKAGSLI 
RSIKLIIRLT 


6401 


2.0302 


A 


6455 


364 


1 


PETRSCSVWPRLGGVQRQDHGSLPPLSP 
RLKQFSHLSLPSS*NHRGMPPCPANLLF 
KFYLFIYL/FGLFVEMGSHHVSQAGLKF 
LS SRDPS ALVSQSVATTGMSRRAWQDI I 
LQQKQAATAA 


6402 


20303 


A 


6456 


91 


300 


GHARHFGRPRWGDCLGPEVRDQPGQHGE 
TLTLQK\KNKN* PAWWHTPAVPATQEAG 
ITGVSHRARPCPPN 


6403 


20304 


A 


6457 


2 


195 


RFSCLSLPSSWDYRHEPPRLAIFFFF/C 
FWNFKKRGGFPLWARWVSNS * PRGIGLL 
WPP KGWGLRA 


6404 


20305 


A 


6458 


522 


202 


THNWRPSLQI FTAALFLLAKKWKQPKCP 
STDEMIHKMWYVHKMKY*SVIKRNEVLI 
HS A\KWMNFENMLNEEACWERPHI V* FH 
LYEMSRVGKTGETKSRLVW 


6405 


20306 


A 


6459 


377 


161 


EMNIKCPSTDK*VKKRY/LYIHTMEHCS 
ALKGNEVLI IHAMTWMNLANIMLSEKSQ 
SQNTTYSMITWKCLE 


6406 


20307 


A 


6460 


92 


484 


LLQACFLRT VL ID YKMWAEKGM VGMCVL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V—Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon,/=possibie 
nucleotide deletion, \~possibIe 
nucleotide insertion 














MRNMTVT 1 1 FRDRVLL * HPGWHAWRS Q 
LTSA/VKRSSCLKPPKC*DYRCEILCQV 
E I AFTHRS LFNS FLKRSMRVILLNYWKD 
PSLGFCSYDSTGMALLRNFS 


6407 


20308 


A 


6461 


483 


192 


MGLML I LRANFQS TKKKM I CLLKKS LTF 
LNCIVLSNWPCTFNMYIFVCVFAVYKLH 
VWAE \ P VAHACNPSTVGGQGERTA* AQE 
FETSLGNI IRPCLY 


6408 


20309 


A 


6462 


451 


98 


KQVWQLL * KQVWQLLE SLNMKLL YDLiAV 
LNLGAYPRE E KTH VHTKTCTQMFTAAP F 
I \ I KK* KKNFS IDMKYEERDKMWT I HVT 
QHYLIIK*SID\YAPTWMNLENIMLSED 
SQTQKTTHCIISFI 


6409 


20310 


A 


6463 


395 


67 


FLRFWCTHHWSS/DILYPVCSLLSLNRS 
HS FPRVHKS PL YHS YALCPHSLAPT YE * 
AHKMF / RLVNFLFLVETGFRHAAQASPE 
LPSASNPPFSASQSAGTTDASHRAQTK 


6410 


20311 


A 


6464 


485 


220 


FFFETWSHSVTQARVQWCNLGSL\KPLP 
P\GSSDPPTSAS * VAGIAGMSHAPTHEI 
PSIQARQQHLQGEKKIELLSLPTSASQV 
LRLLA 


6411 


20312 


A 


6465 


1 


371 


I S I P P KKYLR INLTT Y VQGL YEQNYKTD 
KRNQGRSSKWRDGPCSRIGKLNIRMSVIj 
SNL I YS FNVI LVI VPAS YFVS I T /RLI L 
KFT * KCSRPRI VPLFHLLLPLLVPFVCS 
FILCRPGASLVK 


6412 


20313 


A 


6466 


489 


175 


YKIIEISRCIK/GLL*LWRTTHLFSHPH 
WRRGMAYEKNADFRLGTVAHAFNPS tlg 

g*gr/wnqgfktslanwkpcly 


6413 


20314 


A 


6467 


2 


367 


fgtetlgqagi kgdpgekshwgeglhql 
realkilaervliletmiglygs*ltel 
paaagsagapgqtghppgshhla\epel 
gsgagpagtgtpsllrgkrgghatjstyri 

VAPRSRDERG 


6414 


20315 


A 


6468 


472 


260 


TTLGGAGNSGYRHIQPHVANYLFICRDR 
/SLILLPRLILNS*L*AILLSRPPKVLV 
LQT 


6415 


20316 


A 


6469 


88 


435 


RGVLLHMLSCLPLCKTCFCSSFTFHHH* 
TS FS fs FFFFLLGNKVS FYCQAEGQGPN 
LG*LKPLPPGLKQFSCFT/LPEKWE*QT 
EPPGPVKRHFGLSYFPLTRG 


6416 


20317 


A 


6470 


34 


106 


EFAIPFLTIYPIELNTHVHTKICI*MST 
AALSTR*KSPTCPSM\lNKMWYIHFMEY 
YS AI S KGRNHANVHQ W IHKM * Y IHT VE Y 
YS A I KSNE ALT YATTW / MNLEISTI ML S ER 
SQS*KICI 


6417 


20318 


A 


6471 


390 


92 


DFLIDAINHNLKVFFYS*FISMC/HPYN 
*SLSFSNIQKSLIKTMINVSLSFKY*YL 
IYFS * KKTF*N*RQGLTMLPRLLSNSWA 
QAILLPWPPKVLELQV 




20319 


A 


6472 


414 


167 


H J_ri± Kl (JA* 1 ii bAA* b ± iAKK* KPKCPT 
ADEWI NKMWYVHT I EYYSAI KK \NKLL I 
P WMNL EN I ML S TGNQTQ KAT Y CM I P F I 


6419 


20320 


A 


6473 


3 


244 


TTILLGTTYDALFPELYTTYFVTKTLLL 
TSLFL* \ IRTAYPRLRYDQLIHLL*KNF 
LPLTLALLI*HVSIPITISSIPPQT 


6420 


20321 


A 


6474 


12 


224 


IASGRPFFCYDVCV/VKRLCRHSIVIIM 
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NO: of 
nucleotide 
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M 
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od 


SEQID 
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nucleotide 
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ing to first 

amino acid 

residue of 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GGALKKKPGGEPLFPH 


6421 


20322 


A 


6475 


417 


155 


FQS FTEJSTKMLS SSI TCS S E I FGERKS QLi 
M*RPSLIi/SY*EKLPLIASTTVISQRPS 
AL*VQ PL / PTAKNL * LTAGS KGHYYFLN 
SKVFSN 


6422 


20323 


A 


6476 


1 


131 


RTRPPTRTRTRGRWLTPVIPALREA/EE 
AGG*LEPRSSRPAWAI 


6423 


20324 


A 


6477 


830 


513 


WNSVKESLPABTWFLPFLGPLIDIFLLL 
XFALCLVNLLVKFVSSRLQ * FHVKMMLA 
QGFQPIPSSDPEDKDILPLSLLEQVS\G 
ILPLQC*AGSMPITSAGSSYRR 


6424 


20325 


A 


6478 


399 


196 


SKQSSHLSLLSR*KYRCAPPWWAFF/SF 
FLINI FCRDE/SLTMLPRLLLNS * PQVI 
LLPWPPKVFGLQA 


6425 


20326 


A 


6479 


1 


411 


NTGIMPANTVS ILIiLIDQEVLVTVKTCY 
IiRNTFCGFSDGSGHSV\QTFWIGSTILD 
AIKSL/R/DSREDGKISTLIGIRKK*IP 
ILMEDLVGFKTSQEEGTVDGLNLA*ELQ 
VEGPRDETELL* PQDQTCMDEKFLLMDE 


6426 


20327 


A 


6480 


381 


162 


AGFTHHNFFFLF*DMVLLCHPGYSAWQ 
SWLTVASNSW\VK*SSHLSFPKH*DYLQ 
AVHFRMASNSPGLYLLiYQ 


6427 


20328 


A 


6481 


379 


85 


S ILFFLFHPINFFFFFF*AWI FI I PPPH 
\RAFKWRWGFFLADSVWF IKRLPQYY 
FFFFFPGFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFLSVC 
GG 


6428 


20329 


A 


6482 


355 


117 


PKVKI FPPPPLPRKWDPRGP P * S PGTNF 
S I FFIi VEKDFPHVAQAGLKIiLDS S S TPS 
VAS\QSV* ITGMSPHAWLQVLLSL 


6429 


20330 


A 


6483 


402 


97 


T AFHLLD I YH KS * K I MFTNQ PLYKCPLF 
I IAKN* K*LKCPRRADKQIVA/WCIHTI 
E YYSARKRNVLLKYTRLWMNLKS IMLCK 
RSQSQDVTYYVIPFICHS 


6430 


20331 


A 


6484 


714 


365 


NGFALVAQAGV/ QWCDLGSLQRLP P IFK 
QVSCVTLiLiNS*DYRSMPPHPANFCIFSR 
DGV* S YWSEWSLTPDLGLPKCCD YMRE P 
P CPANAKS FTVEETHVT YQAYETMLLSD 
HDESG 


6431 


20332 


A 


6485 


408 


146 


LKTETQTETCP PMF I AALFTLV/ RRWNQ 
PQCLSTDEWISQI*YICIMNINSDIKRN 
EALKHAVI CMNLKNMVP FI S P TE KGKKK 
EKKK 


6432 


20333 


A 


6486 


243 


11 


GMSQYQE YS KSNFFLSNRVLLCHPGWSA 
VAH/SRLTEALNRFK*SSCLGLPKLWDY 
RRVPLLLAQNATR *NSL * S CPA 


6433 


20334 


A 


6487 


403 


161 


DMVSPRWPWSQNS \SLK*SAYLSLPRC 
WDYRREPS CQAS RNTS IGRGNRKNQWNT 
CM YLL F I L I PLKKYMCNGRVME YFKT 


6434 


20335 


A 


6488 


292 


24 


DWRGRMAI PLKKGS KC * QGCRNTGLLRY 
*W/MECEVAQPLQEMV*WFLQRLNIKLS 
YDPDF/PLLGIYPKE*KAGT*TDICMPM 
LVATLFSG 


6435 


20336 


A 


6489 


3 


203 


SSWDCRCIPPCLANFLICCRDR/SLTML 
LRLVLNS * TQAVLLPWPPKGMPGLLFVF 
YGGIVLSYVNIS 


6436 


20337 


A 


6490 


413 


30 


RGP/YQPGFLGFSPPNGLGFG*KVFIjPL 
P P P PRKG PGKKGS PRARGFPGQAP P KKE 
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nucleotide deletion, V=possible 
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NQRGGG FWS RGRLAKKKGF WFG * GPAKG 
KKFFFFLGAGVWGLKVLI P I KKKKGFP P 
PPLFFFFFFLQTIHPIRG 


6437 


20338 


A 


6491 


101 


335 


SSSFLGFPKYWDYRREPL/ WPS / LWQFF 
RLNIHLLYNPS I P I LAVYPREMKTCDL I 
KSCMRMFTVAVF\* ITQTWKQPNVF 


6438 


20339 


A 


6492 


532 


127 


STVFRARQLPCLHEPGRQSSP\PVPTSI 
LYT\PLPFPSSRYQQNGPCLSSPKLSPS 
PBLGTQSFPVHKSFSPCLPSPKPLTVPP 
ALPSTGPSQSL*QKHSIiASLLHTHSHTP 
PAPSCNPESWGLDATAAISTCFYG 


6439 


20340 


A 


6493 


399 


1 


LCPAPRGGVYRGR/ ESSLSWDGLHSVQA 
SWPLCLPTPASAMVDAPPPA\SLPSRSS 
ILDCCISS EQG S VGMG PAE PGAG * NLL V 
CCLLRPLEKCSIWAGVSRFSWYSLSQLP 
LARKGRS PNPLS FPNEKMPRPVS 


6440 


20341 


A 


6494 


319 


75 


KKIWGPLKKKFGLQTRRPKKWFFKRPPP 
F F FFF F F F FFF /HVAI FLL F CRDRG \ LA 
ML PRL I LNS WP * R I LQ F WL P KVP GLQA 


6441 


20342 


A 


6495 


333 


190 


LFFFFFFFFFFFFFFFFFFF*GFFFFIF 
YSFRDGVLLCHPGWSAAMPS 


6442 


20343 


A 


6496 


1 


299 


LPRLEYSGTITASNSWAPAILPPQLIK/ 
YNFFCCC*GRVLTVLPRLVLNSRPQIVL 
P P PP P KSAE I TGVNHHVWP KKI I SNEKT 
THCS C S ALL AKQ Q I S S 


6443 


20344 


A 


6497 


51 


266 


PVNVILPSSWNLRHSPPCPARFFVLFCL 
LFFGKN/RGLAMLLRLVSNSWP*VILPP 
QP PE * LGLQVQATMLS S 


6444 


20345 


A 


6498 


366 


78 


L LAHCHS SN3STKS LL I LDNFN I YH YGPSN 
TVTS*F\*MSSLMILLSSP/C*GTSWSY 
LFYALKKIFCKD/RGLAVLPSLV*NSWP 
QWLLSWPPKELGLQV 


6445 


20346 


A 


6499 


37 


396 


FPRPTRP KS S S PAKATQQNT I S FKKKKK 
SKQKKGNTVI IIRVKNQRKRKQKN* KLK 
Q*N*EQ*KINKISKPLAKLI/KKKRVKT 
*IANIRNERSSITT*STDIKRLIWKYY* 
QL 


6446 


20347 


A 


6500 


706 


394 


VPLLLPRLAVQWHNLGSLQPLPPGFKRF 
S CLSLLS S * DHRHTPPCPA/NS FVFLVE 
TRLHHAGQVCLKLLTSG/DTPSSASQSA 
GI KGMSHRTQPIE ILFYSVFF 


6447 


20348 


A 


6501 


403 


216 


ILNVLSFK/NMLY*HYTTVLRTVRPLSP 
R*ATVTCTYTSGWPEAAEDPQKMTFPHC 
DLFLPHPN 


6448 


20349 


A 


6502 ! 


377 


116 


LLPY*EN*FLMYLPPICCLPFDFGNSGF 
CHQKKNFFFFFKTKSLSVA\RLECGGAI 
PAHCKLCLLGSSDSMKNFLNEAEFPNLA 
FNGh 


6449 


20350 


A 


6503 


3 


401 


CVQHLRAS WS SRWAWAWRAPHSEQPAGP 
AAPGNEGLSIRASGCGGCTGYPSSASPP 
ALCSIS\PRP*LPSRRAGLATCSPPCLS 
LPPPPWAPVRPEPPRRVPPPAPWRPVPS 
TTQGLRSASARHRTGRQLHLQ 


6450 


20351 


A 


6504 


406 


55 


WPKGFTPPRFPRKEGSP\PPFPKVFILE 
TRQPGPQTKPPFFPPIFKLTRPGGRPP* 
FPPPQRVRAGKPP*PQSFRVPGAIFPPP 
PPPLKGHPPRPGFQKKKKKERKKRPTRK 
RGGT 
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Amino acid sequence (A~AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine> 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


6451 


20352 


A 


6505 


3 


281 


FFFCDRVLLCHPGWSAIWRVRSRLTATS 
/ SFLGS SDSHACLS SS WD YRHLPPLLAN 
FCI FLKRQGFTML PRP VSNP * AQAMRLP 
QPPKVLGLQA 


6452 


20353 


A 


6506 


1 


247 


LGSLQPP P PG \ SRHS PAS AS QGAGIADM 
S HRTRQ S I VF LRTNNKP Y * I ENMM I T I S 
CTTAYKYE I LRHS FNKI CARS LYEKL 


6453 


20354 


A 


6507 


34 


47 


FDPNLTPRHIRI ILWAWKERMLTASRK 
KKQ 1 1 * R S SNL S GDRLLNE E AS W E P * CK 
KS MQE T I QAT E GWND I F KVL KE KKNL \ H 
SKI L CPAKLS FK YE GE VKAFPHKPKWRE 
FTVTKPILQKC*RENQRTFDLTQI 


6454 


20355 


A 


6508 


403 


106 


HYVPPPSPPS/YFFFSPPGKRGFFPNPF 
ILVSPGFFPSPLF*KPPPEFFFFAPLKK 
IFFSPPRELKFFFFKGPPPFFFFFF 


6455 


20356 


A 


6509 


389 


230 


LSFFLFFRDKVSLCNPGWSAV/VTFELK 
RSSCPGLPK/SQSHCTKPQPGF* KYIYF 
SFFIK\KR*SLTTLPRLVSSSWDQVIHS 
ALWHPKVLGL*AVAHAYNPSTLGCQSAE 
WIT 


6456 


20357 


A 


6510 


101 


419 


ISFFFFFFFFLKKFFFFPPGWRGGGEF* 
LPQTPPFWVKKIFFPPPPKVGGFKNSPP 
PPFFFFFFKEKGFSPVCPGFFKTNPKK/ 
SPPPPPPRGVGLGGWPPPPGWDP 


6457 


20358 


A 


6511 


41 


376 


EWWCETRGGAGQGRAHGAAGGATGRVIiA 
I QARKRRPKREKHPKKVSCS VAQAGVQW 
RDLVLLQPLPPGLRVSLCHPS/WECGGV 
VSAH*NFCLPV/IKDGVLPCCAGWS*TP 
GL 


6458 


20359 


A 


6512 


2 


332 


WEKVKI ATLTGFWK* LI/SGSS RTLMGD 
FERFKTSVEKVIADWEIARELELEVEP 
ENVTELIiQSHYKTPGTIiAFSRLSPDVSL 
YSQLPETRSHEWQAASKIQPRKASTSQ 


6459 


20360 


A 


6513 


3 


294 


FFFSEMESLSVTQVGWWHNLGSLQPPP 
PEFKQFSCLS LP SS WD YRHAPPHIiA/NC 
FCIFNRDGVLPC*LGWVSNSHLGFTGMS 
HCTQPLIiALSFLHG 


6460 


20361 


A 


6514 


1 


526 


FRLGRKPR * GGVM * P VWSRGE PGS VGAE 
AG/RS*SAPRRLLHHPAAGLATGLSASG 
RRSARWKMERASGLSPGGGLGATSRQMS 
PGTQLANP PDHGDKDCLGRI S PGSGKQ I 
QAAGQLPGPPTSLAPAQGRLRSLTPWGL 
QTPEHSEPEGIGHLQAATEAVLPHSTQN 
LITKRNLM 


6461 


20362 


A 


6515 


747 


451 


QAGVQW*DLGSLQPPPPGFK*FSCLSLP 
SGWDYRYRPPNPANF\ * FLVEAGFHHFG 
QAGFELLTSGGPPASASQSAGITGMSHH 
T P LGVHAL VGEN I Q 


6462 


20363 


A 


6516 


444 


3 


PGGGGPFFPPPWGGKPGKIS * / PPGGGG 
SNKPKLPPCLPPGGPK/GNPPQKKKKKK 
GFGPGWAHACNPI I PALWEAEAARGLP 
PDAGRYGPGGRREAVRREAGPQECQDPE 
GQV* GD PRGCGAAHAPARGHPRKA* RGA 
EGSSPGLPTK 


6463 


20364 


A 


6517 


257 


575 


AS S VTWAGVRWCHLGSLQLLPLGFK * FS 
CLSFPSGWDY/ GHAPPGPAKFCIYSGDG 
VSL,C*PGWSQTPHLR 


6464 


20365 


A 


6518 


337 


389 


SQHFV* ISCLHDMPALASQSAGITGVSL 
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nucleotide deletion, V=possible 
nucleotide insertion 














RARL I F I FR \ MDS C YVP QAGLE / P P E LK 
RSSHCGLPKPWDYRHEPQAQPVCRFIRA 
TKCAYVS TQAL S PAP VQ PGMTP P AKRGL 
PCQPPGSPCPSLCHHQLAGTPPYE 


6465 


20366 


A 


6519 


61 


323 


FMWDRVARKGS I EGASLELPLGGSVGGH 
GS C I SMWGALGHS S \THV* HTHTHTHT 
HTHTQTNTHT P S S D S KKS S I LDKE E AE G 
CMTP 


6466 


20367 


A 


6520 


399 


101 


KIQIKATMKY/HNIPIRMGFFFKLSSQG 
/C**RCRK/IYTLKHCQ*KCETVWPLWK 
SVW /RVFLKLNIHPLHS PE I PLLGT YPR 
/DMKT * TCM * MFTATLFITTYS 


6467 


20368 


A 


6521 


1 


276 


RTRGFEGIiRFFLGFFGTVLLCCPGWSAV 
E\QSWLIAASASCLW\VK*SFCLSLLRG 
WDYRCIPPCLANF* *R/CGSWAQVILPP 
QSPKALGLQT 


6468 


20369 


A 


6522 


280 


277 


GDDPSCSMGGGHCIiASRSE*GSVGVGPS 
KPGAG\ YNLLVCYIjLRPLEKRS IRVGVS 
QFSRCHQSWLPFDRKGNSPTPCTYWVR 


6469 


20370 


A 


6523 


2 


327 


SLFS S * D/ YRHI PPHPGNFR I FCR/ SAL 
TMLFRLVSYSWPQVILLPQPPSVGTTST 
SHGPQPGIMFLKADLNPVRCQFRCALAP 
EPYIQDPFLRDDVAFPCLGASAPWHLA 


6470 


20371 


A 


6524 


298 


414 


PGPQSETLTKKKKGGGRLKGSNFTPAGM 
EGNI FF / CFFLSKSHSVAQARVQW/ P * W 
LTAGS LiQAP P PRFKLFS S KL * VAIYAS P 
TLLSS * DY R CTP PRLANF GIF PRDG VS P 
CRSCKAAVFNLLDTRDWFHGRQS FHRLV 
AGDGFRM 


6471 


20372 


A 


6525 


376 


180 


KCKKYPGGVAHACSPS*DYRHVPP/RPG 
YFLHF*QKHFGQAGLDLPTSGDPPASAS 
Q5 AG I TGVSHRTQPQLNYF 


6472 


20373 


A 


6526 


415 


86 


LYL P YDLiAI P FLGM Y PRG KKKCAHT K I C 
TRIFTWLFIIARK*KRSKNPSTREW/l 
KM WNI YTVESHS V I KRKELL. I HAS M I L V 
H* /WMI FENI ILSDRSQMQKITYCMVPF 
T 


6473 


20374 


A 


6527 


391 


166 


FFFFFRDWFSLSHPGWNAE\S *LTVISN 
SWSQARL.I FKFLKLRQGLNLLTRLVSNS 
RLQGILSSWPAKVLGLQA 


6474 


20375 


A 


6528 


544 


176 


TMRYLLTPVGMAI IKKAK\NT* C* *GGR 
E/lGTL/LQCR*ECKHjVLPIiWRTVWRFL 
KKLAIELPYAPPFLFSLEYGLFSLGSTN 
CAQLRS VYRKAVQNHQGVS GE CAGFQD S 
SHSAP VAI PTARQ 


6475 


20376 


A 


6529 


392 


139 


T YVKMHCK/ WLGMVAHAWNP S TLGGRGG 
Q IT * GQAS TKL SHS WGLSRE S RF WLE C Y 
SRS CS FCS CRLPADCTSLTDSKQ 


6476 


20377 


A 


6530 


362 


87 


NGPPFFFFFGEIDKLILILLWKCK/RPK 
I AKTVFKKKNKVGGL I *HYFKAYYKGTV 
lKLiVCJi 1 WHKvRKlDQ*SRRlibPE TELiHI 
YVLVNARDSA * 


6477 


20378 


A 


6531 


1 


346 


APTRPRLKKTLSKLKTTNLQLPYDPAFS 
FLSVYPREMKT*R\FAKICV*MFMASLF 
MIVTTQ/NVPRCPLMGEWLNKWR\*FIT 
* YHTGMKNT\ VMHKTT WMNS QR IMLNE K 
SQSLTY 


6478 


20379 


A 


6532 


3 


294 


FFFFRDRVFLYHPGWSAVV* S * LTAALS 



945 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
2o 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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PWLKRHSHLSLPGS*GHRMH/ PPHLGNF 
FV\FCIjFL*RQCLAiyiLPRQVLRSWPQVI 
FLPWP PKAVLGLQV 


6479 


20380 


A 


6533 


2 


280 


P R Y L VRNP T P T LGS CVT VD K * L S LWAS V 
LYIYIYTHTHTDIYRKIYAF/IFVCVYI 
YIFI * KHIYYKEQAHAIMDAEKFQDLPS 
ASWRPGKLVE 


6480 


20381 


A 


6534 


96 


183 


DQL1WFHRKLKRYDTYTFTVALLARHGG 
SC/LSLGSS*D*RYTPPLQTN*K*FFL* 
ISLPMLDWLVSNSWPQAILLPLPPKSSG 
LEA 


6481 


20382 


A 


6535 


2 


241 


GFKRIDSNFESSVGKVPTN/TIFCERKS 
LSMHQI/SLLSYFKKLPQLPQCSAATTL 
\ I S * QP YTLKQDPPPTRLQPAEDSDGY 


6482 


20383 


A 


6536 


485 


97 


PTTRPILLLGFYTK*LKS\NKSDSCPPM 
FIATLF/ILKCPSTDKWLKKMWYKHTTD 
YYSALKKGNHA/YM*QQMNLEGIMLSEV 
SQSHEGQLLHSTYVRYLK 


6483 


20384 


A 


6537 


411 


57 


LFPSHIKFVLSWQSCFAKTSGPQTPSFF 
FF*LFFF/RDRVSLOELSWSAV/VSWLT 
VASTSQAQACLSLPSSWDYRCPPPFPAN 
FSRRGFTILPRLVSNSRAQEICPPWPSK 
VLGLQRRGI 


6484 


20385 


A 


6538 


438 


201 


IYPKEMKT*VHTKTCI*MLIATLFTLAR 
KWKQTKCSSTDEWINKMWHIQ\TMEYYL 
AIKRNEAQLEHGEIIMLSERGQSQ 


6485 


20386 


A 


6539 


466 


131 


SFPKKGGYRAPPPPPPPFFFFFFFFVEA 
VLS L CCPGGS QTLGLICRS AHLVL.PKC * D 
HRHKP PCLASN/ S FFNRHRMMTNNCLMC 
IKSWCLCI INPFYHLDSSLLWMLSLI YR 


6486 


20387 


A 


6540 


419 


50 


PLPFFKPGPRIF/ CLGAPKKKI * VPT/ L 
REIKLFPLKGPPLFFFLNKGPLCFPGRK 
GRGVLFFLATPKSWGQKI FPPSKEGASF 
PFFFPFFSW*RWDLTMLPRLVSNSWPHV 
ILPSQPPRVLGLQV 


6487 


20388 


A 


6541 


1 


341 


S I IKTAREKCLVT Y* GTS IRLTGDF * KY 
I/SAKILQVRRQ*NDIFKVIK/EKNCQP 
RIQYPAKLSFINERKIKSFPNKQKLREF 
ITARPDLQEMIKGI LYLETKE * YLPS * K 

T_Tl\ f\ TVT T 

HMKV 


6488 


20389 


A 


6542 


409 


290 


KTCKQPTCPSTNEW/ IKL* YICITEYYSA 

T- T/TIMT^T C\T OT"T"T> 


6489 


20390 


A 


6543 


32 


397 


AMIVPLQASLGISTRSRKTLSLKKRKKKKG 
EKETLPGGST * KE KVRKKQNNLLi V* KTH 
IKPG\AGAHPGNPNIWGGGGRQITRSGN 
KNHPG*HGKTPFLQKKKKRGALLKGTNL 


6490 


20391 


A 


6544 


402 


0 


LSWTQ* CKPIAPATGEGGTGGSLEPRES 
RLWCAM I APVNA/NCTPAWVT 


6491 


20392 


A 


6545 


1 


365 


RKVTSLPSTSRCLS ILPHFSDGPRPVKP 

\ LAS STAVNQN* SRELGACVKWEHSGI K 
YGLF*RD/RGLTMLPRLVLNSWPQAILP 
GQPPKLLGLQA 


6492 


20393 


A 


6546 


1 


331 


I STFKS YLRNI FS KAIAAVDSNS TEGSG 
QG*LKTFWKGYTIIDAIKNIYDLWEEGK 
TSTLPGVWK\LDDFEG\FQTSVEEVTAH 
WQIAKELELEMDPEDVTKWLQSHDTN 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


6493 


20394 


A 


6547 


1 


417 


TRPKINKTKNLLFERINKIKKP*VKLTK 
NKNKNKT PHI TK I RNES E AFTI E KL KS I 
LrRE \ YMNNCVYNLL YANRKPRQMDKFLE 
RYKQLKLTQEETETLTD\YITSKEIKFV 
I * NFSTK* S * GPNGFNSE FGRVLWFTP V 


6494 


20395 


A 


6548 


410 


288 


AHFLFF C INK \ SLTMLPRL V * NM WP * VI 
FPPQLPKTLiGLQV 


6495 


20396 


A 


6549 


1 


216 


IDRGFTMLPQLVSNSL\LKRSSHLSLPK 
CWDCRCEPPCPACFHFLSECNTCIKNQL 
HVTLFFPNT*GLTSEG 


6496 


20397 


A 


6550 


479 


146 


VPGFKQFSCLSLPNS * KYQRAPPCPAWG 
FAFS VE TG F PHVGQAGF * F P T S GDP PAS 
AS QS AG I TGLSLHAWP \ENL * KLRLHAT 
GLRLHDHVPLKVPNYLKVCSFKFEIADV 


6497 


20398 


A 


6551 


42 


484 


S MR WNS MNVT I MPANTTS I PQ P TDQE VI 
SVFNS / YYYYYRYCKATAAIDSNLSGGS 
RQSKLKTFWKGLTI IYAIKNICDSWEEV 
KI ST * T I * KKVI P TFMGDFGGVKNS I K\ 
E I TAD VAE IAR*L* LE VEPRVGWLTP VI 
PALRKAEVG 


6498 


20399 


A 


6552 


304 


74 


HTGKAAKKKEC/WRGTVAHAYNPNTLGG 
*NGWIT*GQ*FETSLINMVKPRFYSCIC 
ISGYTHKVIHRI I I CKGKKLE 


6499 


20400 


A 


6553 


432 


62 


FWAGNSRMQQWCTDT/VSAHCSLDLLGS 
SNPPASAS * VARTTGARHHARLMLITMN 
EKPLGARYCARIFFENKLGIYPNESVPT 
KTNANWS KPRS CF S HDPD P GHW I YLM S 
SGLIVNKIGKVK 


6500 


20401 


A 


6554 


2 


174 


YCNLCLPGPSDLPTSAS*VAGTTCMCHY 
TWLIFFCI \ LVE TG F YHVAQAAKL F I Y S 
I 


6501 


20402 


A 


6555 


3 


268 


DAWADNI KCR*G YGAI * TL IHCWWECK \ 
W*ATWWFLIILNMLLPYNPVIMV\PRTC 
I Q I FMS TLF INTKT WKQPR CHLVGE WIN 
CGTCR 


6502 


20403 


A 


6556 


3 


402 


DAWTDNI QCR* RYGAI * TL IHCWRQC \ K 
W* ATWWFLI ILNMLLPYNPI IMDHK\TC 
IjQ I FMS TLF I DTKTWQ P PKCHLVGE W / 1 
KLWSMPAKEYHSWLARTEL/IVP*NT*R 
RIiS CT * LWEKSQ YE KATYCWP TV 


6503 


20404 


A 


6557 


396 


2 


KKQNAFQS PS FLPRL * TPKITI L I S FHN 
IYIYTHTYIYIHVCIFTYIHTHIYIFF/ 
RDR FLL CIiP GWRAWQS QIiTAAS TS QAQ 
GIPG*FFFK/SIFCRDW/SITML.PELiVS 
NSWAQAIPPPWPPKVLWDYRCGS 


6504 


20405 


A 


6558 


393 


175 


FKSQGLVLLPRLEFSGMLIAHCSLKLPG 
FSDSLNSA\SQ*LGL*GCTATPGPYr J LIi 
I YAGELGLNKRS ISSK 


6505 


20406 


A 


6559 


401 


188 


SPPEPFILGGPRVFPPPPFLKPPPRI*F 
WGPKKKKNF/ CPPPGPKMFFF*KGPPLF 
FFFFFFFFFFFFHRSS 


6506 


20407 


A 


6560 


37 


343 


YEPAWLLDIYPKELKT*AHRKTCTRIL 
IASPFKTVKMWQPPD/CP*ADE/WDK*T 
ACLQTVE * CSARKRS ELSTM/ IEMWRKL 
KCVALjSERGQSEQAACYVIPTL 


6507 


20408 


A 


6561 


792 


141 


LFPPPP/ YLSTSPNAGA* TTS PLPPQFL 
YHSPRRQQSGTKPPFFSKK/PEPPSP/S 
PNKFYPGPWWGVSPFKFPPPPQKS *GPK 
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Amino acid sequence (A=AIanine 
C— Cysteine, D— Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, HMHistidine, I=Isoleucine, 
K=Lysine, L=Leucine s M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KSPLPPEAP/PAPPLTQDCPPFVSNPGG 
KKKTPP/SAKKKKDPTPS/PSPPPRH* I 
GGPPGFSPPPLFKTPPPKLILGPPKKKK 
IFPPPGGKNFFFLKGPPPFFFFFFFFFF 
FSLLFPVGWSRWLRSEAADLCGECYSS 


6508 


20409 


A 


6562 


1 


356 


FFFNFIFIYLF/NFLRWSLALSPRLNGV 
QWSDLGSLQAPRPRLMPFSCLSLPSSRD 
HRRPPPRLANFLYF* *RWGFTMLARMVS 
I S * PQ / CDP P AS AS Q\ S TGI TGI CHGTR 
LIPRNPMIA 


6509 


20410 


A 


6563 


416 


84 


LSLPSSW/DYRRPPPCLANFLYF**RRG 
FTMVS IS* PHDRPTS AS QS AG I TGVSHH 
TQPI ISFLKSFFFTIiSNRFNRVCIDSEL 
LLQCKDFYLQKFNLHIAFQEYTHCHGVH 
Y 


6510 


20411 


A 


6564 


2 


51 


FWREMQ I KTVLRYFSL KL VK I KK* DS T 
FFWKG C / GETGNL I HKANfW/R/ SI* QYL 
KKLYVHFLFVPVI PPLEIYLKNKKILST 
K* LFAVF I VTE KC * KKKRCG * T 


6511 


20412 


A 


6565 


2 


511 


VRW/ S CAGL / LHTAV* I PGRI ITSEKCA 
QQVDETHRN/LQRLQPALINRKGPILLH 
NNT CL YTAKP MLQKLNE LGC E VLPHLS Y 
SPDLSPTNYFFNHLDNVLQGKHSHTQQN 
AENAF* EFVKF* SMDFYATGINKLIS /H 
WQKCIACNGSYFD 


6512 


20413 


A 


6566 


3 


248 


GD I GT \ SML VPALFT VP K/ TWNQPVS I S 
K*TDKMWYIYPVEYYLAIKKTKFC/SFS 
AT WM ILEDIMLS KP GTE R * I LHVL YL CG 
R 


6513 


20414 


A 


6567 


3 


258 


WRDHS SLHS * PFGLKWFSHLS PLSNWDY 
RDVPACPANFEKVTi*K*GLIMLPRL\KL 
SSFPEW* ILS SRLPASS FQS VG I THMNH 
GS 


6514 


20415 


A 


6568 


392 


166 


GGGGP PRGGGLE P PGPP P FFLKNKK* / R 
PPGGPGPLFPPPGGGREEKFFYLGGKNF 
H*TRVGPPPLSGGKKKKIRG 


6515 


20416 


A 


6569 


234 


411 


PGAVWHAYNPNTLGGQGHR I S * GQE FKT 
S/LGNIVRPHLY 


6516 


20417 


A 


6570 


1 


175 


FLQHV* NKFQTMS DQ I IGKIDDMNTL I D 
LEKNVKDL\LITQTEVEELEGElSrKIP IL 
PG 


6517 


20418 


A 


6571 


392 


140 


LPSSWDYRHVPPCPANFVLLVETRFLHV 
S QAGLE LP TSAS QS AGI TGVSHQAR P / E 
* *NS IRKGATT I QRDLGPVLQVRYLVRC 


6518 


20419 


A 


6572 


2 


228 


RTCTQMSVATLFLIAKNWKQPRCPSIGE 
WINKL*YILTMEYYLVIKK\NDFSSHKK 
TRGNPKCILLSKRSQSEKA 


6519 


20420 


A 


6573 


412 


100 


ALHSHPLGGPGQGVPKSRI PNPPCPQGG 
TPFFPKNPKLTRPGFARLPFPPL.KKLKP 
EKFPNPG/MPGVPLN* IPPPVSHPGVQP 
NFF S KKKKARGF C LiVLNP S S S 


6520 


20421 


A 


6574 


406 


257 


NKREKKTRKMLDSIVIRKRHIKSQMRCY 
SAPIQMAKV\SNRQSPKFWVRCRQTISL 
*HC* *ECKIVQSMW*TLQKFLPAIVFLC 
I YHTEKLRFTQQP / CT* IFIAALLMIV/ 
RTV/KRIiVCPSMGEWLSKIiW*VNIiSFSV 


6521 


20422 


A 


6575 


370 


68 


EQGS VGMRPSE PGPG YNLLVCR* LRPLE 
KRTI WAGVS * FSRCSLSRLPLARKGKS P 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














NP \ CTS * VRFSNGLS 


6522 


20423 


A 


6576 


1 


259 


IDLIDVLRPSLTVT*AWQWCNLGSL* P 
* P PG \ SGDP S AS AS RVAGT I GQHHHRWL 
IFVIFVEIWFRHILWFHTAIKNYLRLG 
NL 


6523 


20424 


A 


6577 


418 


218 


KGKTRLGRWPPCNPSTLG\GQGGWIT* 
GQAFRTSLANIVFPLM 


6524 


20425 


A 


6578 


I 


401 


EKAGCSGS YL * P * HFGRPRQE DHLTLG X 
*DQPGQCSENLSLQKNF/TNISWA*WPG 
IVAHACNPSTLGSQGRRIT 


6525 


20426 


A 


6579 


3 


356 


HWD YRHE PP FP V* MHFF I WLPGCHS LLA 
SPNLTGCFFS I S ITESLLSLFI YLETRV 
*DLQCSYFWERRG\SLCCPCWSRAPGLK 
RSCCLRLPSSWDYSRLTPPHLANYCILG 
GEGESS 


6526 


20427 


A 


6580 


411 


158 


IGSVAQARVQWRDL>SSSDPSAN*APPGL 
KQSSC\PASLPSRWDY\NKCTPPDLAMF 
YIFFVETGFPYVARLADCFYPPPL*KMT 
GSY* VFTMCQI PH* AFYKF* FI * FRVGE 
DKNSLPAWQHRETLSLQKKYKN 


6527 


20428 


A 


6581 


399 


218 


GGAELKVPK*GGGPGSPPLPPPLQKPRP 
KGPLWPGG*GPPRPPGEAPSPLKNKN* / 
PLPG/AGPPPVPPNWGGGFYPRG*GFG* 
LPLGP * TLARGKQRDPVFKKKKKN*NVN 
KKKAKL * KS * Q FFFFFLKTGSRGFPRAK 
VQGPNGSYPKPQPRG 


6528 


20429 


A 


6582 


392 


214 


GSSDPPNSTSRVAGTTDEVLPCSPGWS * 
NSW/VSK* S ACLGL P KC WD YTRE P QHLA 
RSNF 


6529 


20430 


A 


6583 


390 


257 


PKRPPPPAFFC*PK\SLGMLPRMISNSW 
AQAI I *RWPPKVLEKQA 


6530 


20431 


A 


6584 


65 


251 


HYFNS FNFFLMGS SNHPTKNPG* FFFFL 
TFYKNK/SLTILPRLVLNSWAQGILLPW 
PPKGLGL*AQGILLPWPPKGLGL 


6531 


20432 


A 


6585 


2 


143 


RHMP PCLSSF/SYFLi* RRG FAVL P RL F S 
KSWVQVI IiMPWPP KVLGLQA 


6532 


20433 


A 


6586 


142 


334 


GRWLSQLLILNFTTRSVLKQMSW/MACV 
SETWPA/STLGGRGGRIT*GLKEFKTSL 
QNMARPYLYK 


6533 


20434 


A 


6587 


485 


377 


RTRLSCRFFILSLIFRFGNFYCFTLKF/ 
H**VFLLSFVFYC*GHP/FEFIFKVFFI 
I FFHFKVFI CVFKYLLFFF IL I AETFYF 
FVNI FS FKKFVSVMFVLLIEMF YYGRFT 
IPLR*F 


6534 


20435 


A 


6588 


241 


251 


NSLFTKKKKKKGGALLKEPILRAGGGEG 
IIFFKGPQKSIPGPGFKKAGREKPGPSP 
*NWFF*KGPPFFFFFFCKE/RSFTMLPR 
PASNSWAQAICLPQPPKWGL 


6535 


20436 


A 


6589 


429 


54 


YFLFLPSLPTFFPSFSSFLPSVLSFFLP 
SI/SSFLPSLLFYILSSPPLSSPPSSLF 
LSFFLSRVSFCHPG*SAVAQSRLTATS I 
SWAQAI LP / CR/NRDFTMLS RLALNS * P 
QVIRP PR PP KVLGLQA 


6536 


20437 


A 


6590 


411 


137 


KCIGTITTHCSLHLLGPSDPPTSASRW 
GLQAHHHIQLAC\F*LID*LID*LUVEM 
E S RHVAQAGLELLGS S D P PTLAS QNAE I 
TEMSHCIF 


6537 


20438 


A 


6591 


412 


196 


QPQPPGAKKTSGGGFPIR/WGHQPPPPG 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H^Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginme, S=Senne, 
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W=Tryptophan 5 Y=Tyrosine, 
X—Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AKKYNFFFFFFW*RQALAMLPRWPNCQ 
T* AI ILPWPPKVL * LQA 


6538 


20439 


A 


6592 


28 


400 


IMPNCFPKWLYITFFFFFF*RQSL/DSV 
AQAGVQ *HSLGS TATSAFLGFK* FSCPP 
AFLE * LDYRHVPTHPTNFCVFNKD/MGF 
TMLARLVSNS * PQ / CNP PAS AS Q SAG I T 
GMSHRAQPHLFVLKMEKPPPNFLFILLM 
NQSIHIYLLNFYRF 


6539 


20440 


A 


6593 


414 


23 


QTHLFWFPPGFSPPPFFKNRPRIFFLGA 
LKKNFFFPPPRHKNLFFLKAPPFFFFFK 
RSFFPPGWCAIGLFSLQPPPPRFK*FFC 
PSLPKKWGYRPLPPRPGKF/ cfffffvi 
LiVENGFHHVGQDGLDLL I S 


6540 


20441 


A 


6594 


91 


274 


ILHWS * PLGNYHL/ WLGWAHARDLSTL 
GGRGGR I AGGQDFKTSLGS KQGS I S ARK 
KEKKKK 


6541 


20442 


A 


6595 


391 


251 


HLKRCSTSSVIREVQIKTTTTCHFSE*K 
R/SEINMCW*ECGQTELSALLIRC*\WD 
SEMVQPIjWERKEYLKQISTESSWAPAIP 
LLDI*P*ERCASIjSLRKGWGYRPEPPEL 
ASFLLGSYI 


6542 


20443 


A 


6596 


465 


268 


ILLCHLGCSAVAQD\LTLCSLKRSSHLT 
SPSSWDSRRGPSHLTKFF/C* *RGLNTL 
PSLV* NFWAQAL 


6543 


20444 


A 


6597 


3 


229 


CHGSPQPCPPGLK*SSHLNLPTPK*L\D 
YRGMPPRSANFLFL*RWGRCVAQAGLEL 
PDSSSLPASASCPRQPVLYF 


6544 


20445 


A 


6598 


122 


418 


I IWELCHIMCFFFFFFFKQNLLGTQAG 
GQGGKNNG*LKPKPPRLKGSPWLTLPGG 
WNYGGFPP/HPS*FFFFFLGKKGPYFCP 
PAGGKFFNLKGPPFFA 


6545 


20446 


A 


6599 


1238 


930 


FETDSCS FA* AGVQWRNLS SLQTPPP \G 
SNNS CAS AS / R I AG I TGTHHHARVI FAL 
LVETGFHHIGQGWSRTPGPQAIRPPQPP 
KVLG I TGVTSRARAI S KHFV 


6546 


20447 


A 


6600 


398 


78 


PSSPFYLFIFRQSFPFFARVEEKGGHPG 
FPNPPPSGLKKFFYAPAPPGSKAPGQGP 
PA/RFYLFIFRQSFPF/ CCPGGGKRG1PS 
GFPQPPPFGVKKIFLCPSPPRK*GPRAG 
PPRPRKIFFFFFSSLFFFFFVRWGFTML 
PRLVSNSGAQAI CP PQPPKVLGLQV 


6547 

* 


20448 
- 


A 


6601 


3 


473 


FPKGKKKGPPPPAGG/QLKNPSKKPNKK 
PGAP P RG P RGP KKGGGGLG S P P Q S P QGG 
EKWTLKGGPNLPPPRGGLGGARPPPKIG 
GGPLFPKGGGPKGGTPPPPLFFFFLRRS 
PALSPRLECSGAI*AQTSS/SPPARVGC 
P S P P AP TR P TGLT VS PGSN 


6548 


20449 


A 


6602 


398 


156 


NGRKHTDVKPEELPSWVPTRDLTPTGTA 
GAF / Q / RGYYGY YNKYVTVKKGS I AGVS 
TVLAA*VLFSYCLSYREIKHEQLLRAH 


6549 


20450 


A 


6603 


380 


77 


PI? PK.KtiJJpij^JNif PPr t* *K_KKVJb 3 PPPPGK 
LPLSFLFFFFFF/ RDRVLLCRPGW S AMV 
HGHGLSPVNFCISTRNRAL/SMLPRPIS 
NTWTQAIRPS*PPKVLGLQM 


6550 


20451 


A 


6604 


352 


204 


KRGPKI * NF FFG * RQ S LHML P RWPNSW 
AQMILPPWPP\KVQGLQA 


6551 


20452 


A 


6605 


631 


357 


FFF*DRVLLYHPGWSAAAR\SLLTTTSA 
SW\KQFSCLSLPSS*DYRCAPPCPASF 
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K=Lysine, L=Leucine, M=Methionine, 
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CIFSRDRGFTMLARMVSNS * PQVIRPPG 
PPKVLGLQA 


6552 


20453 


A 


6606 


3 


397 


VLHKLTQRCS AVPLRAPTAFLE I GTQTP 
KRVWKCR * P /RETQNNLLDKEQD FKTYQ 
KATVTKAWCQHEDTIEQWDGIESTEKK 
FHTQRAGGRDHSHSDSVRKKRKESLFFF 

Li krgl i lf pnle gtrgg I s t P 


6553 


20454 


A 


6607 


323 


35 


YLKTKIGKPGGLAALQFLGFRPSKNTQL 
GNRFMCTHTHTHTHTHVHPHTSQYLYPH 
RSISFCSYLDSl/SP*VLTESSNSNPTP 
LKCC*HFPSPCL 


6554 


20455 


A 


6608 


439 


252 


LVPPLFFPRSFLLFSPPKKRGGPKPV*F 
GPPRAFYPPPF/V*RGPPPFFFGAPKKK 
IFFSPPPG*KCFFF*SPPPG 


6555 


20456 


A 


6609 


3243 


2602 


FFFGRDGVLLCCPGWSQTPRLKQSSSLS 
L\PKC*DDWH/RATSSGLQVFLStCCSI 
ISIFFMPSICDKSTKQFSN/PQRAYILV 
G * R * V I YLG WLAFL F VFLRQHLi / NT VA 

qagvqwcnlsll* ppparfkrfsclilp 
rswdhrrlpprpanffvflvetgf\til 

ARLVSNS * PRDPPASASLS AGIMGVNHR 
TWPILVFLHPSVFFKVLQLFYL 


6556 


20457 


A 


6610 


81 


395 


T WAALKL YS TC1FSSS ETEPDVSQSQ I T 
DTTACIMEPGLFW/YRSIIWTL*FFHFF 
WRDGGLTLLTRLVWNSWAQVISLPGPPK 
VLGLQAS ATAPNS F I LIREG I S 


6557 


20458 


A 


6611 


1 


126 


GRLRQENHLNPGGRGCSEPRLHHCSPAW 
AIERDSVSKK/NKNGLKRCFHLRKKTLK 
GE KR I W * P GGRGCS E PRLHH CS PAWAI 
ERDS VS KKKKMD 


6558 


20459 


A 


6612 


389 


67 


YL FMYLF * / DRVLL CHPGWSAVAQS * LT 
AASNSWPQVILLSNRFKSWTVLLLSSKI 
PWQGSTPCPRCYSWHPKCLQLQGLLGL 
DCSSPKYPRGPSLAAFRSLPKWHV 


6559 


20460 


A 


6613 


1 


384 


RCFKVKDLKMRPFWNIQVDPKSNDKCPC 
KRLKRRGHASLPPWEKRRRPGKDGGRGW 
NSAVTSQGT/VEATRSWKSQGSVLP*SL 
* RERS P V\ EIiAHF * QRLAMLPRL VLNS W 
PQDISPPWPPKALGLQEP 


6560 


20461 


A 


6614 


385 


174 


CLRLSALKKP KLNS WY* K* LYIMEY* SA 
LKT\NKG I LS FAI T WMNLDD I LLNE I S Q 
TQKDKYHMNSLICEI 


6561 


20462 


A 


6615 


1 


370 


MNPRAHLS FLLRRKERR I LWLKALGH I M 
/ FTV* GLTG I ALANS SLYI VLHDT Y YGV 
SHFHYVLS IG&GFAI IGGFIH* FPLFSG 
YTKKKKKN I RGGGT KNFLL C PALVQRE K 
VNPFFALTNLWG 


6562 


20463 


A 


6616 


3 


342 


YRHE P P H S A I FAQL YKL * K * L I YALKTG 
KVYVYKL YLNKA I KS Q \ S YNRAS L S LT * 
* PRVKRKRS * PGTGAHTCNPSTLGSQDG 
L I A* VQE FETlSfLGN I AKP CLHQKKKKGA 
PV 


6563 


20464 


A 


6617 


1179 


915 


F F F E T E S CLVTQAGVQ WRDLGS LQ S P P S 
G\ S SD FQL I F VLLVHGQAGLELL I S * S F 
ALAS QTAR I TGVSHHAQLVW S D F I KS IN 
LFCCL 


6564 


20465 


A 


6618 


3 


390 


HASVKMELPYDAAIPLPSIYPKELKARF 
*DLCIPM/ FITALFTXAKRWRQLNCPSV 
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OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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VIRS WGREGVKES CLMGME FHSAR* KS S 
GDTFHNTENCY*TVHLKIVSGKF**L*C 
VLNYVGFFYHNKN I KRLS 


6565 


20466 


A 


6619 


392 


49 


QFLFERLFSSYSGYVSGVKMLGSTEVQL 
FPRKCEMKSCSVMPRLECSGGISAHCNL 
CLPGS RD CPAS AS * VAGL I FY /I FLVEM 
GFHHDG / RELVLNS * P QV I R PHQPP KVL 
VLHV 


6566 . 


20467 


A 


6620 


193 


282 


TPGLK*SSCLSLQNSWDCIWDYRRLPLY 
P VNCFCC * FVCF VC / 1 FVE PGS C I AQYC 
LELLASSNSPMSASQSTGI IGMSHGACL 
ILT 


6567 


20468 


A 


6621 


383 


8 


LNDTVSTFFF*DRISL\CPSWSAVARSQ 
LKAASTSWVQAILPPQPKTNLLGSSDSS 
TSA* DQS PGFK* S FCFRLPGS WDYGCVP 
PFSANF*IFCRDKLSL/WAQAILSPWPP 
KVLGLQALWEAEEGG 


6568 


20469 


A 


6622 


395 


42 


HVSSTSFQGDELHLRIVALPLPVPLHNL 
PLPLSPRSL*C/HSRGWTVLPPSRLTAT 
SLPD/SSCLSLLSAWDCRRAPPHLSGFC 
WSAVA* SW\SLQPPPPSRLPWPPKVRRL 
QPLPSRHPV 


6569 


20470 


A 


6623 


3 


327 


PS P P S RWNP KD VP P NQENLG I F FF F L * T 
RDLATLPRLVPNS * TQVI /WPPKVLGLQ 
A 


6570 


20471 


A 


6624 


395 


47 


KNRALFFFPPPRKRGFFPKPLXWGTPRF 
FPPPPF*NPPPYFFFGAPKKKFFFS/SP 
PGGKIGSFKRPPPFFFFFKQNFFPVPPG 
GGPRGDFGSLQHP PPRVKGFS CLAP 1 1 F 
FSSF 


6571 


20472 


A 


6625 


100 


267 


LSLSKKANISKFWLGT\VAHAV\DPSTL 
GA*GRWIT*VQGFETSLGKIVRPCLYFC 


6572 


20473 


A 


6626 


409 


126 


PSSSFSSSSSSSSPSSSSSSP/PFFFFF 
FFFSFW*KWGVSLCGPGWSQTLGLKQSS 
CLS LP KC WDYRNKPLHLAFLAS L I LFT F 
QEWDITD 


6573 


20474 


A 


6627 


506 


323 


REAPGTLIPCWQKYDMIKAQLVWQFLRK 
PNIYLPYNPAIPL*HTCPRKINTQYT/C 
FITKTCTRMFIAALFIIPQNWKQPT*PF 
CPFTIISHSCSPRDNWF 


6574 


20475 


A 


6628 


330 


117 


DVCPRKI YTLVLQQLCI \RMFTAILFS I 
AL* FCKPMCTS ILEWIKKL* S IRTTEYH 
LVMRRKFKPLLYSMPQM 


6575 


20476 


A 


6629 


396 


139 


FFSPPEKKGVFPSP*NLGTPGFFPPPPF 
KKRPP *FI FGAPKKKK/HI S PPPAEKFF 
FFKRPPPLFFFFFFFFFLFFFFFSAFKL 
KV 


6576 


20477 


A 


6630 


79 


405 


VSGRMTALESYLLASSA\RNKCSSNCKI 
KKMTARLDVLAHRCNPSTFGG*GGRIC* 
AHKFETSQGHIIRPCLYKKKKKGRPFKR 
TQFS PARWQGNGFFWGTLKNSRAGF 


6577 


20478 


A 


6631 


2 


417 


S FI FPCKFLNQFI S FYKNVCHDFDCNCV 
E S KDQLG / WPDAVAHTCNHNTLGS QGGW 
IA*AQELKTNLDNMAKPCLY 


6578 


20479 


A 


6632 


74 


417 


KTVFRYNGWAL * AS ENR I S LADLLLPS F 
HVLLCLPKAG\PNFPLPSS IKKFFDLPD 
LIRSHKTHISEGVLFPFP/RRKEPRTER 
PRRI * TDRS CWVS SLS LI VLDHAFFVQS 
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QFYM 


6579 


20480 


A 


6633 


740 


776 


CSFFLKKQTRSRAAVQAGVQWCHYSSLQ 
PSPPDLKQPPASASQTV/KD*GAYFWCC 
FFLFLKRWGL/NSWQAGVALLPDLPW 
RRLVIPIjSLWAVEPALWAAAQIj*LL/ sp 
CPLASQSPRHQTGQSPHQKCFPDILGLH 
LALRSSFSPGLERPTLRLPPGLASFHT* 
ATWLSWPLPYK/PPKRVNMYLVKADRPE 
PFL*GTRGRKSVMQMTHAKTLQETLYSG 
PEAQAAASVTDEF 


6580 


20481 


A 


6634 


2 


296 


FF FLRQS FALWQAGVQWHDLS SLQ P P P 
PRFK*FSCLSLPSGWD*RHAPPRPAIFV 
FLVEMGF\TMLATMV\* I S * PGDLPTSA 
CQSAGITGLSYRARP 


6581 


20482 


A 


6635 


407 


212 


GRPR*TPPPLGKKKKGAPPPKTKKKKNF 
LGGRDSYSQKKPFFF*RNKTRPGGGGPP 
GIPPPLKG*AKKKGLPRGPKFPVT* /NK 
PPPPPPGGNNQNP FPKKKKRKSTR* I * K 
IRISRFILYFFFFFFLERGFGCSPRGGG 
GGAFLSYGKLWPPG 


6582 


20483 


A 


6636 


20 


223 


PCKKLPQPLQPSAT\TLISQQPLTLRKD 
LLPGKRIi^LiAEGSDDSYQFLVAKYFKIK 
GLSFGRNKLANI 


6583 


20484 


A 


6637 


866 


647 


FFLDRVIi/ LCHLGWSAVMRS WLTAALTS 
GLKQS S CLSL PKLRDYRHEPQHQANNI S 
* S S WSDSNMQPRLKTTL 


6584 


20485 


A 


6638 


398 


26 


KGAPPPNIFFFFLFNIFFFFFFFFFFFF 
FFP INFIiQCLLYYRALHFNKVKVINNF / 
PLNFCAY YHPL *NM*FSYIS FQNLYKF F 
Ll/FGVFNLSGIYFNVL*IANF**G*IiI 
LPGTIYIiKLCYFTTN 


6585 


20486 


A 


6639 


1 


341 


NWDYRHALPCSANFFILVETGFCQLPRL 
VLNSCPQVSHHA\KFLFIFFFLFFLEEG 
GFVGPPRLEYREGSIjQP^LPKLKGSTCLi 
TLPKG*N*RGEPLPPARSSFFFISKFFI 
ET 


6586 


20487 


A 


6640 


391 


118 


NANFEKS FSMGKI L *NSLAW YKKILCD / 
RKNR SMQ KTP LD * * Y YFKKL P Q P L * P S A 
NTTLIGRPALTSRQGLPPAKTL*FAEGF 
DPMILSLF 


6587 


20488 


A 


6641 


352 


285 


KKKKKGGGGGAYSG I PTF / CFFFLNGVF 
LGPAGVQWQNQRSLQPPPPGVKLFFFLS 
IiPSH*GPRHGPPRPGMFIYKNF/CFF/V 
RDGVLLH C PGWS Q T PGLK 


6588 


20489 


A 


6642 


433 


64 


TGNFFLD S LF YFNPGPLGGGG ILKGFFS 
FFFFLETKFFFFPPAGGQWRYFSSL*PP 
PPKFKQFFFLSM/WPRFGKFFFFFFGIIi 
VE AGFHRVAQAGL KLiL S S GNL P T S AS Q S 
ARITAKSHRTVF 


6589 


20490 


A 


6643 


424 


11 


PPSIFLAPPPSRGGGPPPPPKKK/QSPP 
KKKKKGGGGGP KNP P PKKKGP PKKTTGF 
KKNP P QKKKKKNS P P KKKGGP PANK * KP 
PPRPQPFYFLTFYYKIFIYIFLLKFLLF 
YILFFFFLKSSKQGRPLAI 


6590 


20491 


A 


6644 


411 


2 


PPLRIWLASPPSKI F* PQPLHI / YFSPF 
PP*KFFFSLKPLIFLGGFFPKFPPPKKS 
FFPKNPQGVF*NPP * KKKI YFS S PP*NF 
APPKIFLKTPPPFFFFFFFFFVEMGSRY 
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IAQARTRERTRGRTRGRTRG 


6591 


20492 


A 


6645 


34 


381 


AKLGTRGAAKQEKEIKVTRI*WKEIELS 
QFTE YTQKNL /NVTDKXiL* LVYMFSKI T 
G YKVNLKN INC I FHKHTNQQQKI LKMP F 
I IA*K/TIK*L* INLTKDTQDLYAEIYQ 
TLKAEL 


6592 


20493 


A 


6646 


17 


448 


HFYAKLGTRASLSCSGLHPV*ASQPLYL 
PTQASAMADAPFPARLPPNSLISYCCTS 
NE QGS VGVGPAE PGAG YNL V * HLLRPLE 
KHSI*VAVSRFSHYSL,SRLPLARKG\NP 
Q T P CAS W VRQ C P T LLRLALHGLHP LiSNQ 
SQGD 


6593 


20494 


A 


6647 


462 


336 


S LRL P GSND S PAS AS * VA \ G I TG P CHHA 
RLVFVFLVEMEGLGS 


6594 


20495 


A 


6648 


447 


62 


KDDIWIAITHMNRWS ISLAKRKM* IKIT 
MKYPYTS I KMA/ 1 KQTNKN * KKS S S I KC 
W*ECRTMRTLIHCW**HITG*PLWKTVW 
KF I I KLNL YLP YD P A\ TCTQMFTA/ AL F 
1 1 IKY* KQSKYPSSGYCVPA 


6595 


20496 


A 


6649 


3 


1492 


LAYARAFLDDTNTKRYPDNKVKLLKEVF 
RQFVEACGQALAVNERLIKEDQLEYQEE 
MKANYREMAKELSEIMHEQI/CPPGGED 
ERLTE FP SHLQRHQ WDSNKHNGS RDDQL 
VFGRVITSHGPCVGTCFVICKLRMLSKA 
NHWGDRAQ / ERTKGKGRERK*RTTLFLN 
R/RFYRSWKKVHIFF 


6596 


20497 


A 


6650 


495 


170 


GAAEFPASQSAGIAGVSHHARLEEFFSL 
D S GRVMRGT E VLKYLGG YT I T I * S I FFF 
FSFF\WPGLGSLQPLLPGFK\LSLLSRW 
DYRCPAPHLISL 


6597 


20498 


A 


6651 


9 


222 


QTTRPERRNSGDPTHLSL* SSWDYGHVC 
HHAWLIF*K/CFSGGLALLPRLV*NSWA 
QAVLPPWPLKVLGLQV 


6598 


20499 


A 


6652 


241 


833 


KKTPIiAMWNPWPGP PQIPHPLTKLiVQMA 
G / QENGCLSGHDLE S I RP AGLHNS ARGE 
VLGLSSSLGKELVFLQEELDLSEIHIPE 
AQEVEMASGHFAFLHVPVPDGRAPYCKA 
SLSASSSLEPTPPEDTAISSLRPPSAPE 
MLTQHGAQEQVEDHPG/Q*QPSPHSQSR 
PS PQKDPQALVIASLRSRTERKQASHGG 
TRPG 


6599 


20500 


A 


6653 


1 


398 


FFFDIGSHSLiTQAAVQ*HDHGHGSLHPR 
LPGFK* SSHLSFPES * DYSRHTPPRPAN 
F/ EVETEFLHVPQAGLMRCNLLPAT/AT 
SKSARITSVSHHVQPEVSIKW*NNTSNL 
I KCFL L F YKYKC I F I NT KF VF TV 


6600 


20501 


A 


6654 


489 


282 


PAS/WDY/RHMPPLIANACAFLC*RDFS 
MLPGLVSNWWARVICLPWPPRVLRIQAR 
ATTPG 


6601 


20502 


A 


6655 


953 


908 


SQSCSVS\RLECSGAISADCNLCLLGSS 
YS STSAS * VAGTTGARHHTQLI FVFLVE 
MGFHHVGQPGLELLT\ *VIHLSRSPKVL 
GLi * AP S S ARGQNS VS Q 


6602 


20503 


A 


6656 


412 


44 


R I S ERRKS LGAVHTKDKKYS S SRVRTSN 
PNCARSGAS SKQS FLFGKKRAPHPG I KS 
F \ KDALL WDGITGVHHHA* Q I F VRSRE 
R* IRPVIPALWEAEAGGSRAQKIETILA 
NTVKPRPSLLRA 
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G=Glycine, H=Histidine, I=Isoleucine, 
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nucleotide insertion 


6603 


20504 


A 


6657 


3 


241 


FFIFLEARSHSVIQVGMQWCDHSSL*TQ 
TPGLKES \ PASSFHRAGITGVSHHTYLF 
L*GH* SHSQGPHPHDLITSKRPCS 


6604 


20505 


A 


6658 


467 


42 


FFFFFFKQGFGLFPQLEGKEGIPGPCNP 
CPPGANDSPPSTPLGNGGPGGPPPNPGN 
FGIFGI*GGEGGF*TPGSGKPSGPGSPK 
\IPFCWVIRERPHGPPPRPFFFFLRQSL 
CFAQAGVQWCDLGPLQPPPPRFKQFCLR 
NS 


6605 


20506 


A 


6659 


2 


286 


knlkinwt*wlmpwsat/wggslkpgr 
lrlq*aviaplhsslgdrarpylsvsqy 
lyfslslslslslththththththkie 
llgcvaeiihff 


6606 


20507 


A 


6660 


3 


249 


LSPRP * CGDT I S AHCNLHLLGS S SSPAS 
VSQVAGITGACHHAWLIFVFSVEMGFHH 
AGQD CLHLL/NL V I HPPQ P P KVLGLQA 


6607 


20508 


A 


6661 


3 


1507 


PEAP VP FLDSNQENDLLWE EKF PERTTV 
TELPQTSHVSFSEPDIPSSKSTELPVDW 
SIKTRLLFTSSQPFTWADHLKAQEEAQG 
Ij VQHCRATKVTL PKS IQDPKLS S ELRCT 
FQQSLIYWLHPALSWLPLFPRIGADRKM 
AGKTS P WSNDATLQHVLMS DWSVS FTSL 
YNLLKTKLCPYF YVCTYQ FTVLFRAAGL 
AGSDL I TAL I S PTTRGLRE AMRNEG I E F 
SLPLIKESGHKKETASGTSLGYGEYVIK 
ITL/SSSTDLWTHEIDLHRNSL*NRDSN 
CSNF/LREQAXSDEDEEESFSWLEEMGV 
QDKI KKPD I LSI KLRKEKHE VQMDHRPE 
S WLVKG INTFTLLNFLINS KSLVATSG 
PQAGLPPTLLSPVAFRGATMQMLKARSV 
NVKTQALSGYRDQFSLEITGPIMPHSLH 
SLTMLLKSSQSGSFSAVLYPHEPTAVFN 
I CLQMDKVLDME WHKELTNCGLHPNTL 
EQLS Q I PLLGKS S LRNWLRDY I YNWR S 


6608 


20509 


A 


6662 


326 


1 


IMIFLFFQTGSRFVA\RLECSGTITTHC 
SLNLPGSSNP\PTSAS*WTHCRTRVIL 
NFSVDPGSRHWKGGLELLDAILKPWPP 
KVLRTRGSTPEFRSGGSGVRIRGFN 


6609 


20510 


A 


6663 


414 


105 


QGKVGFKVGLNPLSRLLHFFKQKLFVLS 
*KGQVKKLSLKIPFPPGGFFSPPI*NPP 
P * NKPQRFKMPFMGFT / IRKKI L FLAFK 
STKSGG*NPPSK/HKKI*FSF/ CFWETE 
SRSVTQAGVQWHYLSSLQPLPPRFK*FS 
FCKPHKRVFKPLGLVLWGG I SNRRRKKT 
SGGKRNFKA 


6610 


20511 


A 


6664 


1 


407 


GCVPWPAAREAEAG* IA*TRET/RLRW 
AGI VPLHS S VGNKS E TLS QKKKGAP S KK 
SLQGVALFLSQLGKLPPFLQRKKRPQRG 
GGRGQLAPGYNPPAGKTLVGPGPHLSFC 
SKKAWKELRPWKKPRGFWFLINST 


6611 


20512 


A 


6665 


1 


280 


FFFF*HRVSLCCPGWSTDLHLLPP*LAR 
FKRSSHLSLPSSWDYRHTPT/HPGFFVC 
LFVFCRDRVS FVLPRLVSNFWAQAI LLC 
GDRKVMGLQA 


6612 


20513 


A 


6666 


408 


89 


WLPNSLILPN*NYFVSYLSDRWSLCRP 
GSTAVAQ *ELTA\ A* TLGLKRS AS FSHS 
HSCSLPSSWDYRCTSSHLANFFFKRWGF 
TILPRLILNSWPQVXLLPQSQRH 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


6613 


20514 


A 


6667 


484 


208 


I T WS AAGL IHYS FL-NPGE PMTTEKYAQ 
EIDDIH*NLQHLQPALVDRKGPIFLHDH 
A* PHAAQPK\LQKLNKLGCRVIjPHLP YS 
PDLLPLT 


6614 


20515 


A 


6668 


495 


227 


PAFVKVPPEKDQRQNSTHAGRNHLPME I 
LLL * KKKL IETESHSVT \ RLE CSGVITA 
HCNL KLLGS SDP PTSAS D YRLKP WHLAP 
SSFSMS 


6615 


20516 


A 


6669 


2 


65 


DGVLLTLSLLARLVL\NSWPQVINLPWP 
PKVLG*QAGLGTPGLK 


6616 


20517 


A 


6670 


403 


224 


ILL* P * PPGLK* / FLPSS WDWRRASLCL 
ANFLFFVEMGYPRVARAGLKFLGSDPSA 
SQSAGVTGVSRPGRFLIRTVL 


6617 


20518 


A 


6671 


406 


162 


RSYAQAGTQWLDFSSLQP * PPGLIKQSS 
HFS FS QSASH/ RHPDNYCI PCRNRVYVG 
QAS LELLGS CDLPAS AS Q SAG I TGS CL 


6618 


20519 


A 


6672 


453 


32 


VVSATLHTSVVVCACATRCVGMCMTLGV 
CGCECVW*GCVCTSVN* \HTSTGWSEK 
V*LCVCGCV*LCVAECMAFVSRQCVCDP 
VCCLPEKMRLS P I S VA* VCAPMKVS PCP 
PQLSELHLLSLNIYINKCVSLSQNAWLF 
L 


6619 


20520 


A 


6673 


378 


180 


NEVIRSYKVSLFSPRIEKWINK/I/HWY 
IYPMEYDSAIKRNEVLIHATT*RKLENI 
TLNERIFDIIPFI 


6620 


20521 


A 


6674 


378 


94 


P VKNRRTKELH I KD VCTPML I VI LFM I V 
K/S * KQPKCP * IDEWMNKMWYTHMVE YC 
S VSKKEGSPF I CYNEDITLNE I SHKKTN 
IYNSTYIRYLE 


6621 


20522 


A 


6675 


1 


380 


VCSL*VLCRHMSCFQNCSL*FSFSFWFF 
WRQCLPLSPRLECGGPMSAHHNLRLPGS 
SDS CAS AS L VAG I RGACHHAR VI FVFW 
EMG\FHCVGQAGLELLT\*VICPPWPPK 
VLGLQVLATVPGQPHIF 


6622 


20523 


A 


6676 


424 


135 


SHRWGVKCFKYNI FFFRD \RV* LCCTGW 
S*TPELK*SSCLGLPKCQDYRHETLRVS 
Y*F*NSI*QIGVLAMVSKLVLNS*AQAI 
LLPWP PRVLGLQ I 


6623 


20524 


A 


6677 


408 


91 


HTHFVWPPVLIP* * PMR*VLLLTPNTNF 
FLRDRVS P CH P G W S VE AQ S Q I T AASN \ Y 
GLKQFSHPSVPKCWDYRRELPCMVSLII 
FIYVHLHMYVHARKEKPEYPL 


6624 


20525 


A 


6678 


1231 


217 


PPYFKNPLTFPQKRFFKGDKTLKKWL* I 
RE 1 1 LQNRGIG * KTDS IR * MGKTEAPLK 
EKFLLGTALQPSFLPYMLGHFKEE*GEE 
F PGTHS W P FP S F P VWGLGL S VNHRG I HG 
PRVSLQTWGPGQEAMECECGNHGTEDTG 
\GAVLQPG*VT*GFHTPTSEPGLVAPPC 
TCSGLKAEA\GFLKLPGKQG\SEGALPP 
* S PL PS ACFPELGLRRLGS S PGLMIHPG 
WGL FTS PLGLKI LG PRHP AG PAL P Q I AQ 
RPSLKFR *RRLGWGPELLPAQTCGAQSL 
GPSTAPLLGQGPWGLG*HASRWGTLGLA 
VAPGNEVPSLSPR* PRGAGPLHRHRVPV 
RSWG 


6625 


20526 


A 


6679 


1 


178 


RGWDYRRPLP/RPG* CFVFLVEMGFHRV 
GQDGLYLLTL * S S Q S AGI TGVSHRARLQ 
PPIF 1 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G^Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, \=possibte 
nucleotide insertion 


6626 


20527 


A 


6680 


3 


135 


KSEHLEREP I IEQLSTMFF/ TAKYGWHL 
RGH*HRCQKKPEPPEDR 


6627 


20528 


A 


6681 


425 


107 


KGPPTPS/PPPPSPVFPTEKILVPPPFF 
PPPPFKKPPPQNIFWGPKK/IKNFSPPR 
GEKNFF F * T P P P K YFLGAKKKK3STFS P P R 
GEKNFFF 


6628 


20529 


A 


6682 


424 


284 


CFLQPGP \PGS KGASPPS FQDNWGLRQA 
LPPP/RESFFLFLFLFFPIVQTGFHHVA 
RAGLELIiTSSDLPASASQSAEITGVSH* 
A*PFP 


6629 


20530 


A 


6683 


1290 


449 


PLSAPPPGDPSP*KSSPRAPNLPAP*PP 
NLSPWPPKCPRPRR*PASS/PP*LPVPP 
KS P S P P V\ PWATEAS RGHQ PQIT*ELG 
SLVLRMTEKGTWG*GIPDSIMPLPRRRS 
EP RTCRE * GDTRWRWRRRRRS TG AVS T F 
SPGFACGGGSGA/ PAHPGEQKLGASARP 
DE/PVCPAAWVLLPSARWASPAAAPPP 
LPSQPVH*RFKP\SPPY*PP\ALPPLSP 
FEPSAPPHTGSVPY*QGVPP/SEGCPST 
SKPLISVRPPVTINGCLPQSQPPKTHTV 
SRLYEEN 


6630 


20531 


A 


6684 


425 


40 


E WAS LKN* HP P / GANQAVEAVE APTHCW 

* GCKS I EPLWKTVWQCLDKLNI CLLYDS 
TPE*MSEHVYQAACKKSFTAVLFMIPPN 

* KQPS CPSTGWLGQAWWLTLVI PAPWEA 
KVGGSLE PRSLRPAWATG 


6631 


20532 


A 


6685 


2 


107 


SWLTAASTSQ\VKRSSRLSLPSSGDHRC 
MPANFFFLKDDVSLLPRLISNSWTQAIL 
LSLPTKGLEL*AHCSLNLPGSSDPPASA 
YQWG TTGACQIi I F FF 


6632 


20533 


A 


6686 


12 


224 


TPPGSTHASGAPPCLANIKFNY\L*RLP 
RLFWNSWPQVILLPWPSKVLGLQVWATV 
PSPFLIFDRDIDREI 


6633 


20534 


A 


6687 


404 


2 


TAARG 1 1 YGGP I KKLLPRKPADVK/ SGS 
FKTAAPFFFFLGTRTLTEFLFI I IQN* K 
QPKRPST/DRINKLWYISTIEYYSAMKR 
NKLL I HT TAWMN Y KG I MVS KRGQ S Q KS I 
FSRTKTGFGRNHISGCFLYPLPLRL 


6634 


20535 


A 


6688 


410 


163 


PRGNTPPGFSPPEKENQKVFNPPPGNIV 
*KKGGPPPQGAQKKGPPY*GPPLSFLPP 
KKKKKKGPPPPPPPKKNPPKGNS /SFSP 
KKKKKF F P P P P KKKGF * GGP P L KTL FFW 
GGGKKF F F FFWGKKE F P LGG F F FGGGGG 
GGPFFFFFFG 


6635 


20536 


A 


6689 


205 


405 


KSPPSPGGPPLIGKLEIGGAPKPKNWPP 
PLGPKGENPPLALGGGGGAKNHPPNPQF 
ILLFLGGKNP*KSPPSPGGPP/ANRKIG 
NRRGP KTQ KLAP P LGAKGGKP P P C P RGR 
GGGQKPPPQSPVYS PFFGGKKPL 


6636 


20537 


A 


6690 


3 


199 


HGSLQH*PPGFKSSSSLSLPSSWDHGHV 
QPRPAIF/CFFLILEKRGVSPGFPGGPE 
TPGLKGLDLL 


6637 


20538 


A 


6691 


8 


350 


£ VCLGLP/RCWDYRYEPPHLA* LAFL CQ 
1 1 FHHMD W I YHVS I LWDGYLGCFYFLA 
I INNV\G*AGLELLTSGDPPTSASQSAG 
ITGVSHCTHPYCYTFLTREITTPKEVKI 
GLCM 


6638 


20539 


A 


6692 


373 


197 


LPSSWDYRFMP*HMLFFK\FLFLLFCRD 
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E/SLTMLSRLVSNSWAQATLPPQSPKVL 
GLQV 


6639 


20540 


A 


6693 


369 


100 


PMLARMVSIS*PRDPPASASQSAGITGV 
S HRAR P I Y FF K * R * S HAG YS AVP VHRHD 
PTTDKHNS FLFCFLQTGS CFWQVGMQW 
HDHISL*PRTPGFKQYSHPSLPRH*DYK 
PTPP/HPS*FWQVGMQWHDHISL 


6640 


20541 


A 


6694 


12 


182 


FLSSWNYRCVPPH!LAN*KKNFFFYTDR/ 
S LAVLPGLVSNS * TQAVLPLQP P KVLGL 
QA 


6641 


20542 


A 


6695 


402 


248 


S S WDYRCAP PCPANFYKIL* TGS / LLC * 
PRFVSNSWDQVILPPWSPKVLRGV 


6642 


20543 


A 


6696 


301 


309 


VQEF*DRILLSPRCHPGWRAVA*SRLTA 
ISASRLKARPSHLSLLSSWDYR*VPPGP 
ANFLHFFCR/DRGFAMLPRLVSNSWTQV 
ICLLRPPSVGITGVGHRARSILLD 


6643 


20544 


A 


6697 


3 


387 


PCLANFF/ VFFVETGTHYAA* CGLRLLG 
SSALPV*TS*SAGIIGMSHCTC/LQITL, 
LKTESHS\VAQAGVQWHDLGSLQPLPPG 
FM* FFCLS I QS S WDYRRTS PRPANFC I F 
SRDEVLPCWPGWS *TPDLR 


6644 


20545 


A 


6698 


1 


287 


LWFLYQGNAGITGISHRAQPNS FVFFKD 
RVIiLCCPGWSAVA\HSSLTVAIiTSQAQV 
I FLtPQPPG* LGNFC I FSRD / RGFAMMPR 
LISNSWAQVIRPP 


6645 


20546 


A 


6699 


412 


3 


FEMESCSVARLECSGTILAHCNLHLLSS 
S S S PVSASRVAGTTGACCHTWLFFFFFF 
V/FLVETGFYHVAQMGLNNWAKAILPQP 
P*VLGLQARITTSSPNSSFTREHTSGSG 
VGYPAKFHS I VLLLL YLLFHLL CGVF 


6646 


20547 


A 


6700 


393 


88 


QKLYPYLTPYAKVN*QWIKNLSVTAKNI 
* FLGENTGVNLHS FRF I KGFLDM / TPKS 
ISTKRK*KIKWSSSKL\KNFYALKNNIK 
KMKRQPTKYEKIHLTLNNWV 


6647 


20548 


A 


6701 


494 


239 


KAGSHSXVSRLECSGMISAHCTFCLLS* 
NNPLTS ASLAAGTTGVHHHAWL I FVFS V 
QMGFCH VTQAWVIi I FDCGFES P INIKYN 
LL 


6648 


20549 


A 


6702 


2 


379 


RPTWVTWQNPASTKKKKNFFFFKTKSPT 
VPQAGGQGGNLG*RKFLPPGLKRFSGLK 
RPEAPKGQS CSLMGGGLCPLLP P / WSGT 
PPQR 


6649 


20550 


A 


6703 


220 


47 


TGFSGSQIN*EK/IWLGTVAHTCDPSTL 
GGAGGQ VT * GQE F KTS RHE PLHLT VL CS 
LF 


6650 


20551 


A 


6704 


80 


414 


ICYYFIVFAIYLNQFFSNCYRSFTFFFC 
PFETESRSAAQAGGYNPTLTLLHPPPLG 
IKQFSCLTLPSS\WARS*KPPCPTWGNS 
VFVGIYKNKLGGGGAXRRVQTCNQTYSG 


6651 


20552 


A 


6705 


69 


416 


ISTVEKRNNIHNIKKTPVYFKGFLLYCCKA 
E VF YVLDNPF I FYFL F YFFETHCVAQAG 
VQWHNLRS LQ P SHP \ S S SDYRCAP PHLA 
NFCRDGVSPCWPSW/S *APGLKLIHPPW 
SPKVLE 


6652 


20553 


A 


6706 


373 


75 


HAQLTFKFYYFRNTFCKAVASTKSHSSD 
RSE*SKLKTFWKGLTILDAIRNISDSL\ 
EEVNMSTFRGV*KKLTPI IMDDFEGFDF 
TRESNHRGARHGGSRL 
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E=Glutamic Acid, F=Phenylaianine, 
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K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
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T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
XHLTnknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


6653 


20554 


A 


6707 


331 


379 


K* KINKN* KGLT I LDAI KN X CDSWEE VN 
I S T / LTGVWKKL / I TL I DGS EGFKI S VE 
EGNAGWEIARELEIEV/EPEYVTKLLQ 
SHKKT 


6654 


20555 


A 


6708 


578 


152 


LQQHNPGSLQPLS P \GFKRFS CLSLPS S 
WD FRHAP P CS AN F F VFLVERG YHHVG * A 
GLKLLASSDSPTSASQSAGITGVSHRAQ 
PLIGFKVNSVKKSVSVPTRRILKGNMSM 
YEQL 


6655 


20556 


A 


6709 


392 


146 


FFETESLSPGLECSGTVSAHCNLGSSDS 
PALPS /RLLGSSDS PAPAWL YSFKL * I 
Y/ FHKTCNYVLKINILLHVAI PYSFSWS 
C 


6656 


20557 


A 


6710 


9 


393 


KSKPCN*DKSQSYRL*LFAQLFTF/SNF 
FYFLETF*ESHCVIQAGIHSGTIMAHHT 
LKLQGSSNPPASAPPSSWDHRCGPPPLA 
TFLKIFL*RQWAHYVSQARDNGLIMFPR 
LVFELLGLKRFLLPQPS 


6657 


20558 


A 


6711 


34 


387 


QDFGFLSFFLLFFFEKGLGGPTCRAGPE 
SNLME \GGPPGLKQFSGLTLPRTGNNGR 
ASRSRAKFWN*KGKGGPPGPPVELETFT 
LRGP PHLS S PKGKNNGGGP KPD FFKKVG 
KKAPNP 


6658 


20559 


A 


6712 


1 


188 


IQAWWYTPWPATWEA*AGGLR\EPRRS 
RLQ * AVI WLHS CLGDRVRP CLNNKKGS 
FPPQPLW 


6659 


20560 


A 


6713 


377 


98 


SLGLKQSSHLNLPSSWDYRPVPPCPRNW 
FVPVP PFF\RE * GIoAML PRLVSNS*AQV 
ILPPQPSKVLGLQARTTAPGFFSHLLKT 
LVPNNVNLNI 


6660 


20561 


A 


6714 


353 


229 


SS CLGL P KS WDYRHE P PHLAYF YY* * RQ 
ALV/MLPRL*Y* FLD VNM * L P L P FHVAP 
SHLLNSVGMV 


6661 


20562 


A 


6715 


371 


64 


ENHGDCTPAWATRAKLRFKKKT/ CYQMF 
ITTLFTIAKRK*LKCPSSDK* INKMWYI 
HMME Y / YAI KRNWT L VKLNQN P YAL * KV 
LLRE * KQNP * S VRNI FKTHI 


6662 


20563 


A 


6716 


462 


232 


HFNLITIKIIPFF/C*DGGLTMLPRLVP 
NSWAHAILLPQPPK*LGLQARACKL 


6663 


20564 


A 


6717 


425 


48 


GFGGPPFKKARFGFLHPF*TPPVLRPRP 
P P VF FGGF KAS P P KKKTP K/ P PGGGGP P 
P YFHP F FL S PFFP I F FEKE P P P FFFWRG 
LGPPPPPFKNPFFFFFWVEGGKFLPQKK 
KKKF FQSIS TRSVIHK 


6664 


20565 


A 


6718 


3 


413 


GTFLTKTS LCGTLLY* VI S YMCV* FI S * 
\ S YL VY I I Y FKD I C V / LL YM Y I F YI YT C 
VHISVCMYMYVCMYVCMYKA* PANLKTF 
CTQKHLAWGWVQWL I S VI S ALWE AEARE 
S PE VETNIiAQLGPTLSLPRKKKKKKKK 


6665 


20566 


A 


6719 


408 


71 


SSWDYRC/LPPGPAtfFCIFSRDGVSPYW 
PGWSRTPD*RSLPGGDVMPALiVIRHVPE 
LLKKQHSSVRSSRPAWPIW 


6666 


20567 


A 


6720 


375 


57 


SVIREMQIKTTMKYPFIPTGLIVIKKTE 
NN/RGWQRCREIKACIY*WWGCKMTQLL 
QKTIW*FLKKLNIEL/PYDPAIPPLGM* 
LRELKTCLPRLGWAHACNPSTLRG 


6667 


20568 


A 


6721 


391 


164 


R I SETCT * MFPAALF IV/ DQNNPDKWTN 
LMWS YTHTMDDYSAI KRNEVLLHS S TWM 
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nucleotide insertion 














\ENLENLMLSKRGQSPKTPFG 


6668 


20569 


A 


6722 


381 


278 


SSLQPQTPGLKQSSCLSLSNS \PPRFID 
*SISLYRQGLTILLRLLSNSWSQAIVI J P 
GP PKALGL * AS KKFLYSN I 


6669 


20570 


A 


6723 


2 


363 


QRSLEHRPCKKC/ CGLW* VS * KVQYLV 
YKE S S S FKNLLRR IHCNFLFSG X VFHF * 
KLLRTPRSFCLCGL.HI P INFT I F* I KI E 
KFKKC* PGTMAHP YNPSTLGGLGGWIT* 
GQDGLDLLTS 


6670 


20571 


A 


6724 


352 


58 


HGGAIGARFFKVFNLI* FNLVCKDE/ S F 
SLLARAVINSWPRVILPPWPPK\WGFQ 
AKTPLPRLKGFLESLPSLKKKKNLAYAR 
TITAQIGKLVCFIDQQ 


6671 


20572 


A 


6725 


383 


54 


KSSPFFFFEQGLLCCPGWSAWSPLQPK 
CPRLKQFSPLS/LPSN*EYSCTPTTFSL 
HVCVS IHKYI * YIDQYI FKFF/ CR/DRT 
L AML PRIiVSNS W S QAI L P P * P P KVLGF Q 
V 


6672 


20573 


A 


6726 


3 


343 


IjE V* ARATRQE * KINGT * 1 1 KYKKVKQ Y 
SQNDMILYIQNPKDSAKKLPELVNKFSS 
FKVNRQNS WFL*3STSNKKS /EKEIKKKL 
/ PFTI AS * RI KYLGINLTKEVKNLGRAQ 
WVTA 


6673 


20574 


A 


6727 


3 


615 


HTSQGNL * IQYHT P WHP P S S ERVERMNQ 
TLKSHLTKLVLKTRDSWTKCLP IALLKV 
RT VPQKE VGLS PCEMLYRLP YSHFTVD I 
PTFETKSQFVKSYVLGLSSTFSSLKAK/ 
ELFSTDATLGSSPGTSAFSLGDDVLIRS 
WEEGKLKPAWEGPYLVLLTTKTAVQT/D 
NKKMQ THHTP VKKAS P S S KS CA I VP R P I 
PTKLKI KKKKA 


6674 


20575 


A 


6728 


2 


228 


SGAI / SVHCNLHFPG/SSSDSSASAS * V 
AGTTGVHTMA* L I F VFLVETGFHP I S Q A 
GLELL/NHMICPPWPPKVLGLQL 


6675 


20576 


A 


6729 


98 


359 


KVE YTVLLF*GFKGKFPTIiPL YIjSFS S F 

iiknhlisnfflnrvllcypdwswg*ft 
ahcnlnlsclslpstwXdsrcvptgawl 

RFI 


6676 


20577 


A 


6730 


10 


428 


RSTRTDYAALREVLPDPRRCRPRTGAWS 
NGRRDRPGAGHSGPKAEAETRETLEKGL 
LFCCPGWSAVA* PR P TAT FAS RAQSLT V 
SPRLECGGWSAH*NF\SFHFQRWGLAV 
LRTLVLNSWSLALLLLWPRKVLGLYALA 


6677 


20578 


A 


6731 


425 


158 


SQTHDLR* STCLGPKCWDHRCAPPRPAL 
FLF/FNKCPMS IRKIENAQ* LPGI * KKN 
FCRDDHLPLLSRLILNSWAQGIL*CRPL 
KVLGL 


6678 


20579 


A 


6732 


431 


22 


RDRVL L CH P E * SAW * S * LT T VQNS WAQ 
GILSSI*DYSCVP\LPVNF*TFFKSVYY 
YFIIFIFE I YFCNS KALGL / LGVSHRAR 
PPRLVLNS WPPQVI LPSQP PE * LGLEAL 
ATAPAI PRNSNLVWHQYSFKS I PDDL 


6679 


20580 


A 


6733 


44 


414 


AIYDLPFLLIDLYPMELRAS IRAHTCT* 
RFVATLFTVTKRLNQPTCPSTDE *QNKM 
WSL/YIMNYYPAIKRNEILIHENIMLCK 
RSQSQKAHTE*FHICEISRISKSAETED 
ELWATRWGKGRE 


6680 


20581 


A 


6734 


375 


49 


QRLTYHCRFS YNTASNKMRLS * TQMHQK 
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sacsmcpgrl*gvhamrpkvlmrlsrsh 
/varpiaatcaq*vcdrik/r/ slliee 
qkiwkvlkaqtqsqtakfknetffrnk 

N 


6681 


20582 


A 


6735 


381 


88 


SLFPRLETPGLRPSFFFFLFSLSH\ PGW 
SVW*SRLTTALNS*ARFSLLSWCVPLH 
QAHFF\FFCRDR/ sltmlprlvlkswaq 
AIPPPEPLKRSGLQV 


6682 


20583 


A 


6736 


384 


293 


AHLSLLSGWDCRCMPPCLDNCVCLC/RV 
F VCR * GQAMLPRL VSNS WTQVI LLP WHP 
KVLGL * AQD QE P LWGRS G P GL PQAAS G 


6683 


20584 


A 


6737 


24 


300 


NILSAFLCLS*P*H*YGLVIL*NVPQFR 
FLPLFFPMQMYTHIFFKIRIRPGWSHA 
YNP S TL VGQGGR \ S HE F KT S LDNI ARPR 
LYQVQNKIR 


6684 


20585 


A 


6738 


2 


353 


IHC W * KCK I T YTLEKT WQFP I QLNIVL 
PYIPA/IPLLGIHTREMKTHIYTK/T/C 
MQMFGWLFI IAKIVFQLVNR* IKL* S I 
HKVDYHS E/ IRNKQLYEAT * LNLKCT VL 
SE*S*TQKTT 


6685 


20586 


A 


6739 


2 


336 


IHATYTNP * HRH \ VHTPTHVHT / HVYTH 
NGTHTHTARAWTPAPPTSLILPSPTESP 
PALQIPNLPFGLSSSPHPAGSSQGEPEE 
GGCQRRAKNRVGHRAEKWETELCAGDMN 
L 


6686 


20587 


A 


6740 


3 


385 


KKQATLLWI IHLATTFSVEI *QSKIEVH 
D I FKVLKEKKK* TTFYPKI TNPGKI AF I 
HEGKI KLY/ I VK* QLRDF INSGLVLKAM 
LKRVL* S KRKGH* QTIRNLRVQNSLVTA 
NTQPNTEYCNTVIWGN j 


6687 


20588 


A 


6741 


406 


155 


TGSCSVTQA* /CWSAVIVHCSLDLPGSS 
NPPASASWIAGTTGMCYHAQLPSFNLCL 
QFYCMGCVFSRFYLRFTFGLQSYTRKQK 


6688 


20589 


A 


6742 


1 


215 


LPYSPDLSPTD/Y/HFSKHLDNFLKGKR 
FHNQQDAENAF * KF I ESQS I DFYAVR IN 
KLISHWQKRVDCNTGSYFD 


6689 


20590 


A 


6743 


376 


109 


YHMEVWNLYSLKGSSYLRLPSSWDYWCP 
PP * QANF* K\ FFCRDKDLAMLPRWLNS 
*LQAILLLRPPKAEPLQLLGQFQFRRES 
GWN 


6690 


20591 


A 


6744 


382 


148 


NIPQFMFDILFQ*LITLSCPASF*SKFP 
LSSLYFIQ*GR/WLGTVAHACNPSTLGG 
RGQR\ *A*DQELETSLAHKARPHL 


6691 


20592 


A 


6745 


403 


209 


HIVINRI IRKYCA*HYANKFNNLDKLDT 
FLE\NTTCKNLTQEETENLNSPVSVKEI 
EQGKLCSDRN 


6692 


20593 


A 


6746 


366 


40 


QGSVLS I LNVKI VFWFKTGCVHFLFPLN 
FLSLFQNYVCSHTS/YPLSLIPHESLWS 
FPPSFNHSS*Y*HFLKTN*MPDAMAHAC 
NPSTLGS*GRWIT*DEEFETSLANI 


6693 


20594 


A 


6747 


392 


57 


APKKKGGGVYPLYPKKSGRKKKPEG/PP 
PKPKKKLKTPGRKNDAKDIKPGGAKFFF 
SPKKKRGAPPGVPPRLF*KPPQATI*GG 
PKKKKKPPQEGGKKGAPPKAPPLFFFFL 
YF 


6694 


20595 


A 


6748 


376 


140 


LEDNI /GYSNQNAGAGRVS * INTPKIKV 
S \ Q S K I AKWD L I KP KRL CAAKVS MNRVN 
RQP P E WDR I FAIiAS AD\RGFLNK I LG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=0Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine ? 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6695 


20596 


A 


6749 


540 


172 


FFLRQSL / DS VAQAGVQWRHLGSLQAP P 
HRFMPFSCLTLPSS*NCRCLPPCPANVL 
YF * QRWGFTVLARM VS IS* PRDP PAS AS 
Q SAG I TGMSHCAQ WNTLS LHAFSCDKV 
P QNDLL / WP S KS F CGTL SHE KACKDNVF 
TTWAQWLMPVIPALWEAEAGGSRGQE IB 
TILANTVKPHLC 


6696 


20597 


A 


6750 


1 


353 


WFLERESTRSENSVNIVEMTT*DLEYYV 
SKAVAGFEKIDSNIKRSFTVGKMLSTSL 
H/CYREIFHERKSQSMCQTSLLSYFKKL 
PQPPQPSASATLIVNIKAKISAKRIKLT 
ECSDDC 


6697 


20598 


A 


6751 


404 


231 


SPS/PPPPYFGGPPGFFPPPVF*TPPPE 
YFFGAPKKKKN~FPPPGGKNFFFFKGPPP 


6698 


20599 


A 


6752 


77 


371 


I TCP VLYLTSRNTKTKKYYLDCCCFWP * 
WNNRDQTYLHLFFF/CFIKKNFFFQPKR 
GARGGFCINETPPLGGKGNPPPPPPGRG 
EKRAPPKTLCFFFFF 


6699 


20600 


A 


6753 


354 


169 


ETE S HS VTHAG VQ WHNLG SLQPPPPGFK 
RFSCLSLPKK/WFSVKLN/PNT*SFCYH 
ILKALII 


6700 


20601 


A 


6754 


1 


442 


CX CR TAFE LTGRATRS E KEERKKERM YD 
* KKERKKEKEGILLKKDGQEEEEGRKEE 
RKEGRRQTQDE FGGQNRL S T VKQGS * PK 
KRK*GIQ\QTGAPTQERDKETSQNKNEE 
KARHDS*QPQA*RTQSRLQQEGRKLWKR 
GLiVSRRK 


6701 


20602 


A 


6755 


366 


262 


F F * RWGLTMPRKLVLNSWAQVT LLPP PL 
E*LG*QA 


6702 


20603 


A 


6756 


3 


223 


GLKQS S CFCFLSNEDCKHVPP \hrlffl 
FVCLFCFVLFFW*RYGGLAVLPRLVLKS 
WP* VI PQPWPPQMLRLQA 


6703 


20604 


A 


6757 


418 


244 


FSLLSSWDHRYMPPRMATF/SFVLFCLR 
EGCLTRFLRPV* NS WTQAI FPQS PKLLG 
LQV 


6704 


20605 


A 


6758 


46 


256 


KLKNKTEARHS S S ACL* /LPGS WDHRYL 
P PRLANF * TFWRDRVSLLP KGVSNLWAP 
TILLPWPPQVLGLQV 


6705 


20606 


A 


6759 


391 


55 


F I FFFFPHSSHGVS PCCPGWSRTP \GS S 
GS PAS AS QHAG I TGRNHHTRLQD I Y* LE 
/TGQTVTLQFSVLADTTLTKRLSWS ILC 
CYNKIPETG 


6706 


20607 


A 


6760 


377 


2 


FGEPPGFSPPPVLKTRPRNLFLGPHKKN 
INFPPP/WELNLVPLKGPPFFFFFLR*G 
LAMLPRLVSTSWAQVILLPQHPKLLELQ 
VHAMVLSSRTSHSPRSRSLSPRFSQRLL 
PVSCNPAQPSPMA 


6707 


20608 


A 


6761 


2 


259 


LTI A* KNF I FLGL * QGICQ I PKDYQTS L 
R* IKDNLNK/WDTPCSWIGGLNIVEMSV 
LSKXMYRFRAIQMKTTSGFPFFFFWKKG 
IFF 


6708 


20609 


A 


6762 


1 


169 


FFETEFRSFA*AGV*WHDLGSLQPQPSG 
\ S SDS PAS AALFFF FFFLKKKI FFFPPG 


6709 


20610 


A 


6763 


1 


496 


MATCTPKHINTQICVSFLLSLKLSEFEQ 
MS I FKKGENELALGTGNIRP YDS CPGNR 
P/APFATSPAPRHPSGPSWRPPTSGGVP 
ARAECPAS PNPLPPGFCGQS PSRGRKEE 
*LRGESTSTLAIF* * APNRYHS F * LPRQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 
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NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














P * PHGD CP ARKKKKKPND PRLQPTVT 


6710 


20611 


A 


6764 


1 


229 


VLLLCFFFFVL* DRVLVGHPGWSSRLYL 
KTKTQKKQNNIFVCRVGG\LTTLPRIiVS 
NSWQMVhS PQPLNALGLQ V 


6711 


20612 


A 


6765 


580 


273 


WRQSLTHSVAQPGVQWHDLGSLQPPPPG 
FKRFSCLSLPSSWDYRRAPPRPTNF\*Y 
LVQTEFHHVGQAGLELPTSSDLPTSVSQ 
SAGITSVSHHAQLDMIFLR 


6712 


20613 


A 


6166 


220 


239 


AQE FKAAAS T TAL Q TE TL CLKKKKKKKG 
GVSLGGVSSNFRTGGGASVTSPSIF/CF 
FFFF*DRVSLSGVQWCSLQP*TPGLKRS 
SSLSLPSSWEYRHVPLHPA 


6713 


20614 


A 


6161 


89 


387 


FLSPSGSQKEHLQIQLRSTFSFFFFFKR 
EPRFGRVGGQCPNLG*WKPPPPG*RQFS 
GLTLT / S NWE RWAT P P P P TN F W I FKKKG 
NS P WWKRGAK I FGP WD P 


6714 


20615 


A 


6768 


397 


223 


IREVGWINWQHKDPCD/ KWNCSVY* L* 
* W I HKP THDKT \HRNTHTHTHTHTGMHS 
HTQK 


6715 


20616 


A 


6769 


3 


367 


SWKVCSKRPIPSPSLFF*KHPRVX 1 LRAD 
GT* *VSLATL*HMQLS*AGSVKEAEPRS 
NAP PHKATFL FL F F CTD / S S LTMLP RL V 
LNTWPQAILPPCPPKVLGLPARANSSSL 
SVFKLTSFCC 


6716 


20617 


A 


6770 


490 


260 


FFETESCSW*AGVQW\LELLTSSDPPA 
SASQSAWITGMSHCARRCGTHYHLCERF 
L VS F VTALLS QLLELGLAHSRC 


6717 


20618 


A 


6771 


418 


130 


PSPRDLNLFPLNGPPLFFFFF*DRVLLC 
CLGWSAVASN\ FDLK*STRLGLPKCIiDY 
RHEPLCPAMWDTLSSLi*EIFGLFCHRPS 
LPAPRTGPGSQ 


6718 


20619 


A 


6772 


311 


314 


FAVGELFLLLHHVGHACLPFCHDCKFPE 
ASPAMIiPY/QPVEL*AT*TGIEVLTPGC 
RFQAVFGLKVGFHQGPTPVYLGI 


6719 


20620 


A 


6773 


465 


283 


CPSLPSSWDYRHPPPRPANF/ * YF*YHV 
GQPGLELPTSSAPPASASQSAGITGMSH 
RAWPK 


6720 


20621 


A 


6774 


410 


91 


RDHSKLQPQTFRLKQSSCLSLPSN*DYR 
HMLPCLAGWIAGWLACQTDRQTDRQTDR 
QTDRQTDR*TDR* IDSRDR/SIiAILPRM 
FSDSWPQVNLLPQ\WPPKVRRLWA 


6721 


20622 


A 


6775 


409 


253 


L I FAFF \ CRDGGLAILLRLVSNS * PQAI 
LP PWP P KVLGLQARAPGVRPDLS 


6722 


20623 


A 


6776 


3 


278 


FFFFSLHLWAPIaAFFFFGAISAHYNLPI 
PG \SSDS PASAS Q VAGTIGACHHVQL F \ 
VFLVEMGFHHVGQAGLELLT\ * VIHLPQ 
PPKALELQI 


6723 


20624 


A 


6777 


79 


442 


RDEDRGLERRC VS ACS TPQLGNASTT * G 
CPGVSDPSLHILF/CFFKEN/ SCFAPQA 
EGGGHNLG*LEFGPPGLKQFSCLTPQRI 
WNNGGAP P P PLYFF FLKKRGGS P LWPGW 
A*TS*PRS 


6724 


20625 


A 


6778 


77 


417 


HRGLHLSFFIiF/SFLFFFLKRAALLWPR 
EQGRGW I WVNCTLS LRGQG I S KAL P S * E 
AGTKGGRPHPTLFLVF * KKTS FSQVDQG 
GIDLWARGNPPPGNLKWFELQKETLCPG 
HIF 


6725 


20626 


A 


6779 


62 


235 


I QELR I REGKCLAYEHTARNGKS QDDHA 



963 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine> S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GLSCLCCCCCCCC\CNCWP*STRKFFTK 
PE 


6726 


20627 


A 


6780 


908 


566 


SKMEFRSCCPGWSDLGSPQPL\PLGFKQ 
L\SCLSLP\RAWYYRHAPTHPANF\*FL 
VETGFFHVSQAGLELPTSGDLPTSASQS 
AGI IGWHHAHPECVFQKQLSEANLRRI 
WNQED 


6727 


20628 


A 


6781 


11 


483 


TTALLKAVRH/G*HLSLQRLLLSF\VCL 
CPAPRGGAYRGRQAS* SCGGLHPVRASR 
LLCLPNQAWAMAGAPPPTSIiPPCSLI SD 
CCASNQRDS VGVGPSE PHAG YNLP VCRF 
LSPLEKRS IRVGVTRFSRCRLSPLSLTR 
KGNSLTPCASRVSSASPCS 


6728 


20629 


A 


6782 


225 


589 


SRRSEPGNFRSCLGVGVTCCMCLSLSLP 
SRSFLRPLFLSLSVCVFVCLGTRVPCAP 
EGGFLARRPFFWSASPRVSAWVLWPVGN 
RFPGGSGLGVCEGLGNVGI \ WSEPQGFS 
FPPHPEQPLC*AGSRRALPNQGQR\LPG 
AHRLPAGGCPQS FKKVWTPVALCPHRE 
M* PQLDAGTEKEAGKGMGQACLSLKGWP 
SWPSHPFDTPPRMPWVAWPPRILPGLW 
PLL* PPAGRNARRKKPETDRRGRSAGSQ 
PQACADGVRVS L P KSGALVMS RTGL P VC 
PWAALS P VARSRGDQNPATSGAAWGWVIj 
PAVCVSLSLS LLALS FALS FS LCPS VCS 
CALGHVCPVRRRVGFLHVGLSSGQPLPA 
SLPGCCGRLAIVFPAVPAWGSVKAWATW 
ASASEPQGFSFPPHPEQPLC 


6729 


20630 


A 


6783 


458 


170 


PCI VTAS AS Q S AG I TGVS HRTRLLVN F S 
KVG S VT I S LG * E VGLRHW P / N I / GRLD K 
TGPGE KQHFI LHAP QS AP SVYHCLGNTQ 
ELPPLSTAMISQPKR 


6730 


20631 


A 


6784 


1 


192 


FFFFG* ILPLSPRAGVQWHDLGSLQPPH 
PGFKQFSCLSLP\RG*DYRACATRDGIiI 
FCNFRLGD 


6731 


20632 


A 


6785 


3 


246 


SLCVCQYYCRCACV*VPKQVSG*VTMGI 
/CATGFVLIYLCG*QCLCG/C/C*EDPF 
YCEQLCVHVCS C VI.GRGWS VCVCISRIiL 


6732 


20633 


A 


6786 


408 


128 


GPVPYKARG*SPFWPPGEKPSFPKNPKT 
PPGGGGAVLFPPPWRGRPENSPPPGGKR 
C/QPN* IFFLPPHPGHKTQGPPPKKKKK 
KISAEFLSW 


6733 


20634 


A 


6787 


1 


418 


LT I LGVMKNI CD S WKEVKI S AFTGAWKfC 
LIPTLTD/EY*GFKASVEEGTAHVLETA 
RELEVEAEAVTEL/LQSHDQT 


6734 


20635 


A 


6788 


3 


237 


FFFL T * VLT I V/ PG * SAEVQS * LTAAS T 
S * \ VK* S SHLS LP S S WD YRRMP PHPANF 
FFFFGKKSLILHPGRGGPSLPTP 


6735 


20636 


A 


6789 


282 


3 


PHPDNF \RFL V* TGCLHVGQNGLALLS P 
GYPPTLASPSAGITGVSHCTRPKRYF*M 
VLSNKSLNRFGLCTVLPTCLQAPRRTGP 
PPATVLGRGT 


6736 


20637 


A 


6790 


62 


328 


FFRDRILFCHPRWSAVIQ\SWLTEASKS 
W \ I K* S S RLGL P KC WD CRHE P L CLDNTT 
Y*CRQNTLPITLIGCFRVSLPYAKYTGK 
FWGAEG 


6737 


20638 


A 


6791 


67 


353 


TNKHTYFIiELNKREKPQLTVFFFF*REIi 
RGL ILANGNFRLRG* SNSPA* PS KVAGI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
KNLysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *-Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TGP PHQAGLI LVFLEKTGFNYVG / RRGP 
PPPPPKELALQA 


6738 


20639 


A 


6792 


744 


382 


MTFKKTAHIYFYLYLFIYLRQSL/DSVA 
Q AGVQ WHNLS S LQAP P PG FM PFSCLSLL 
SSWDYRRPQPRPA/NFFYF* *RRGFTVL 
ARMVSISRPCDPPASASQSAGITGMSPR 
AWPKLHIFKL 


6739 


20640 


A 


6793 


372 


129 


RVLiP CHSGWTAWQS * L IAVS TSQA\ KR 
SPHLS IRHVPPHPA/NFKQFFCRDGLIM 
LPRSVSNFWAQAILPPQPPKVLGLQA 


6740 


20641 


A 


6794 


395 


25 


QPGLFCFTPRKKPDS FSKVFAVLSF/ FP 
PSL*FS/PIiSL*VQEMVSICCPGWSAV\ 
TPGLK* S S CLS LPKGRGYRCAP PHSANL 
KF F L * R * VLFML PRLGLH \ S WVQWL CQ 
WPPKGPSLRACTSKI 


6741 


20642 


A 


6795 


412 


176 


KKS* /CLATPRSENWVL*NGPPLFFFFL 
*RRDLAMLPRLVSNSWAQVMLPPWPPK\ 
VLGLQAGATMDHS C VQ YSHS FAPE 


6742 


20643 


A 


6796 


444 


95 


NFVRNCQSFLQSGCTFLHSYQQFLL/PC 
PCQH* IMSVFQILTTVIGVYMHLV*ICS 
ALMTFDLFMCHVL I CHLHI FFGEVS VQH 
FSLLLFLFESGS I YWQAGLHLLFSSNP 
PASAS 


6743 


20644 


A 


6797 


378 


732 


F FFWLNLGS SQP P P \ PAFTR \ FSHLGLL 
NS RD YRHL P P RLAN F \ * FL VE TG FRHVG 
QAGLKLLTSGDLPALASQTTGI TGVTAS 
PSLDEGVFKALPPPVGTTLCLPPWTTGG 
PFLGPAL 


6744 


20645 


A 


6798 


444 


1 


PLCFPPGGGGFLSSHQGFPPPPPKRKGS 
VGAKKKNI P PGP FP F FFFKKNFF F *GPY 
PF/SEPKKARWGPPPPGPGFSPPRSLVS 
PLGEKKKKKKIPPPETQPPSFGA*WGKK 
RAPPPGPRAGGRNS FLEGGRGQGRTGGS 
VREFRAGR 


6745 


20646 


A 


6799 


462 


83 


IEPLRPALCLS*SNPASI*KPHFLLFF/ 
CLPFLSFFFS/SFPFSFPFLFPFPFPSF 
FLIPFLSYSLCCSGWSTVA* *gltias\ 
T YGL KQ S S HL / S L PAS WFP CLTN * KKI F 
FRDGGIiTML I RLAYL FLNF 


6746 


20647 


A 


6800 


433 


185 


P PPGGRLLVEKTRGEGQLKPGQPGRQAG 
LGMRGNPGKAQPRPASGAPEMQLPARPP 
SGGHLYGCGAKNNEGGANPLLCGGTGSG 
S VAAAGE VS KS APD SGLMGNS MLVKKE / 
AGGGGE P QANQE T EN * EG * P MS PE S GAL 
LLTSPAAATLPLPVPPQRRGFAPPSLFL 
APQP 


6747 


20648 


A 


6801 


395 


60 


IEFLLFLPLSVDGRLWSHQLLPVMNEAT 
/MSIP*HAFAWTC/ SSFLRGGCLGA*ST 
LL YGGYMFS YLRNKQT I FQS S CC I VNGE 
QPRMRNRLFS*FIQHLVFSVFLIRAILE 


6748 


20649 


A 


6802 


423 


98 


RGVPY*PGGLQPPPV/REFFFFFIGRVS 
VLQAGAAAPTCRAQS I LPGS WDYRHLHA 
NMSS * FL YFV* RQVFSMLPRLVSNS WDQ 
AI HP S Q P HP AL F FT C WS VKP LATL 


6749 


20650 


A 


6803 


396 . 


194 


S LMKLAH V* VLHTDGGFMHPD S AI S CHD 
IFDSLHLTGEGSAKICRALYELIMQLME 
EHP\GEKQTTIA 
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SEQ ID 
NO: of 
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sequence 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=liysine, L=Leucine, M==Methionine, 
N-Asparagine, P=ProJine, 
Q=GIutamine, R=Arginine, S=Serine ? 
T=Threonine, V=Valine> 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6750 


20651 


A 


6804 


373 


192 


PALNPWAQANLPHQTPALSSNFFYF/ CV 
E MGLTVIj P RL V * NS W PE AI LILLPWHPK 
VLGLQA 


6751 


20652 


A 


6805 


370 


242 


KKGTSKFLFFPSYSFFSKFWYQKKVKIA 
FFFF* DRVSLCHPEWSAW* S * LSAALT 
/ CPGLQGSWGYRHVLPCLANFS\FFC*D 
GVSMLSRLENWPFCLF*PVFQPGQHGNS 
ISTKK 


6752 


20653 


A 


6806 


464 


182 


DGPHRFEPPISS*FLPPWPPQKLEIKG* 
APIiAKQIFFFFFFL*RDlVLLGC*GWGT 
VYSVYSWVSSRLTAAS\TAVLKQSSHLG 
LLSSWDYKHHH 


6753 


20654 


A 


6807 


3 


228 


YTCAGFFRRQSLCHLGWGAVARSQLTVA 
LTL/ VPPGLK* SSNLGLPKCWDYRCEPP 
HLACVNFMTVKDVCFSCILS 


6754 


20655 


A 


6808 


506 


173 


NLHLPGLKQFSCLSPPPRSRDYRRAPPC 
TANF\ *FLVEMGSRHVAQAGLELLNSGD 
SPASASQSAGTTGVSHHGQISLFKQILT 
STRISEVIFLTLLLRILHDLPCLKLFSIi 


6755 


20656 


A 


6809 


676 


397 


VSSCPRDQPGHHGETLSVLKIQKLARHG 
GGCL * S QLLRS LR * QTRLNSG VGDHS E P 
RIiSHCAP / AWATEQVS VSRKCKKRQLHQ 
KERXPEWVRS 


6756 


20657 


A 


6810 


367 


31 


QVFQELYSQDLFIYLETLSLCHPG*SAV 
ARS*WTAASDSW\VKQSSRLSIiPSSWD 
YR*VPPYSAHFDVNSSFLTYCTS PGLTQ 
KIEDSWLGTVAHACNLSTLGGHSGQIT 


6757 


20658 


A 


6811 


469 


251 


LLSSWK/YQVPPPHLVFXKIL/CRIGRE 
TASCCI * PRLWNSWPQVILRPQPPKML 
GLLSATMPGPISVFCFCFCF 


6758 


20659 


A 


6812 


16 


462 


E I IS YCGFNLHFSNDYKKKERKRERERK 
KRKKENIGE/HI*DIGVGSDILDLIPKA 
WTTKLNI DKWYH I Kl> /QKLLS S * RVAKT 
NKLKS FCTAEE 1 1 TRVKRQPMEWEKT FA 
SQTSDKGLISNTYKELKQLNSTETNKQK 
NNFDLLSADKG 


6759 


20660 


A 


6813 


492 


243 


S S WDTGCTPSRPD \ I FVFLVETGFPMLA 
RMVL IS* PRDPPASASQSTLVDTSRYKV 
YQNIHQQWMF *NYGWCFFD I FTASVGD 


6760 


20661 


A 


6814 


32 


308 


EYTLPHPYLITGPPPVPKPETYSHS * YF 
L S PNTQ C I Q \ PAP RTLNTAS S T CLS RRV 
TARPVPIYQDPSLEPLQLPPNSLTLPTP 
LRHSPCWV 


6761 


20662 


A 


6815 


97 


274 


GRFLE LKVKRG S LGLGM WHAYNL S TLG 
G * GGR I P \QGFKAS LGNIAKS CLYKNKK 
GAA 


6762 


20663 


A 


6816 


74 


376 


D FFI VYS VFTHLVN I KDVKDVL YKSLKY 
F FAI YRMGENI YKL Y I LGG INI QN I SME 
YYTA/ IKKD* ILLFAATWM/RLEDI ILS 
EISHEQKS\KRCMFSLICGS 


6763 


20664 


A 


6817 


1 


1127 


MRVTAPRTVLLLLSGALALTETWAGSHS 
MRYFYTAMSRPGRGEPRFIAVGYVDDTQ 
FVRFDSDAASPRMAPRAPWIEQEGPEYW 

dretrnmkXasaqtyrenlrialryynq 
seagshtlqrmygcdvgpdgrllrgyhq 
daydgkdyialnedlsswtaadtaaqit 

QRKWEAAREAEQLRAYLEGLCVEWLRRY 
LEMGKETLQRA\DPPKTHVTHHP I SDHE 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryp to ph an, V=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ATLRCWALGF YPAE I TLT \ WQRDGEDQT 
QDTELVETRPAGDRTFQKWAAWVPSGE 
EQRYTCHVQHEGLPKPLTPERWEAILPS 
PTS P I VGHCAWAGAVLAS WWSSGAWAT 
VDV*EGRSSGGKGG\SYSQGWRPSRPVP 
QGFWMCLSTALKKP 


6764 


20665 


A 


6818 


34 


382 


STAGAIHVCMCMCVHVSTCVCAKMCVRV 
KF * I WD I E TGAAQ L S CC VLVAQTE 1 1 YL 
QERNSSLSERECVCAK\CVCMCMCVHVS 
TCVCVCVCV* V* KRHNWPGMVAHVCNS S 
TLGG 


6765 


20666 


A 


6819 


1061 


305 


FFLRWS L / DS VAQAG VQ WRDLGS LQAP P 
RGFTPFSCLSLPSSWDYRRPLPRPANFF 
YF* *RRGFTMLARMVS IS * PRDLPASAS 
QS AG I TG VSHRARRRNT I LY * HLNF V I K 
S HKP PD I VFRDL * G PRLHL ENP VL * GAK 
SHSSCPWLNLSPPPPHAGSHCLSSSSPR 
MGARQDLLFNAS SHTGLFPFLQLNQTAS 
RPTSQVPKSLCFPLPWVQPFALPWVRQD 
EGRASWSPQDPPRVPSCAGFTHSCNLS 


6766 


20667 


A 


6820 


610 


248 


ERRSHPVA* AVVKWCHLGSLQPLPP \GS 
SDP P \ T * AS * VAGTTGVRHHARL I FVFF 
I ETVSHHVAQ P TGF * TPGAQAI RLPQ P P 
EMARITDVKPLCLAHFCYSKHFIMVPAP 
CISHQKVFH 


6767 


20668 


A 


6821 


3636 


3332 


TCLGSHATPHCGHS FS SCKI LLF * FFVF 
EM*SHPVAQAGVQCRDLRSIiPPPPL* FK 
RFSCLSLPSSWDSRRPPPRLANF\CIFS 
RDGILSCWPGWSRTPDLR 


6768 


20669 


A 


6822 


563 


146 


APGLSFNCYIPPAPAFMPGELTLiFEVD/ 

NRSIFPTEY/TIHILISSEDILHS*TIP 

LLGLKTDAIPRCLNQTTLTTGRPGLYYG 

QCSEICES/N/HSFIPIVLELILLKYFK 

T*STSTLKYHCKAIQHLPFKLKIDGVYT* 

SLQ 


6769 


20670 


A 


6823 


399 


201 


KAEPPPAKSWSKGGYVLPRS / IRLDHVK 
WALEPDD IAVLNFMKERHPQS KS * TILQ 
KWIQPFSPIVQK 


6770 


20671 


A 


6824 


323 


323 


LPTVIVI ILSFYNFFFFSMRLIRTLSFI 
CMYVCICIYVCIYVCIYVCVYIIYVCIY 
/IYINISF*NDRISFFFMLE*YVCVYIH 
EWIKKI 


6771 


20672 


A 


6825 


396 


122 


LPPGGHRPAVPGGREMGPSRGNPGNPGR 
F * KKGPPS PGGGPAPGGPP S PEG * ARKM 
G*PRGPSLP*TQIPPPGSPP/WGPPGVT 
PETLVGFKKRGHPPRGGGPPPGVPLPRR 
AKPEKWVNPGGQAFHEPKSPPRGPPRGA 
KPETLS PKKKKKV 


6772 


20673 


A 


6826 


414 


64 


RGVE GGAAC I F KGL S VAGKPG / YL S AMR 
SCFRRMLLFKS*G/QVIATKHMLPTVLR 
MVGD P VAWVRFNVAKS L QK I G P I IiDNS T 
WQSEVKP ILEKLTQAQDVDVKCFAQEAL 
TVLSIiA 


6773 


20674 


A 


6827 


2 


519 


KTKTCLVEEIS*ITGRFLERINKIDKPIi 
VNL * RKK \VQ VLN I RNE KVD VT TDRT * K 
\ I IKEHYGQLYANKFN / IDEMDKFLKRD 
KLAKLTQEE I ENLNI PVS I * KFL I *NI P 
KMKTP /GPDGI SREL 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E^Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G— Glycine, H=Histidine, I=lsoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, Wpossible 
nucleotide insertion 


6774 


20675 


A 


6828 


413 


114 


GGKRGPGP I PHP PARGPQ* GPGHPAVQV 
PEPPGLSFPHFRGGFPSSGGCQRGEGRK 
ERRPSWTGPSWMAQEGRTPAARLEGAVL 
PG/APGPEAPGTGTTMSAPTWPEPP/G/ 
GRQVQDVPAFPEGSARQAR*HEE*LHWK 
EILL 


6775 


20676 


A 


6829 


479 


261 


PCFSFLSNEDCRHAPP/RPV* F/LVERG 
FRHVARAGLE FLT * SGLP AS AS QGAGIA 
GVSHCAWPLQVG I E AS LQY 


6776 


20677 


A 


6830 


492 


313 


HP I GWP VNWGRAS AVGE FRD WKLF YPE C 
E I RTMGGRE HGQ S P GAQRT F FQLLL S FX 
VESK 


6777 


20678 


A 


6831 


378 


1 


SNGICLQASSASPNKIVPSTSLKVCSN\ 
VRVDGSRNPSVAFPPLTPNLFAVAPS IS 
SGMGI ETI P I QGYRVDEKTKKCYIPF\ V 
*ANRHSPSGVYNINVHALVSGPPLSDDS 
GVNKPQMKHQHCSA 


6778 


20679 


A 


6832 


374 


76 


QIj VE KNTGNVKS MTDRVLTMRCELRARL 
*ALRTPGTWNLINDQIGMF\KFSGLSPK 
QVE CL VNEKH I YLMP S GR I NVS GLTAKT 
LGYVAAS IQEAVT I IQ 


6779 


20680 


A 


6833 


369 


56 


LKYLGTS VENRYADQAKWLS PVTSALW* 
FE/ CGWIARSEDRDHPG* QSE I PS/ PAN 
FLFLVEMGF YHVDQAGLDLMTS GNP PAC 
NTGMSHCTRPGRTDVLAILCSDP 


6780 


20681 


A 


6834 


457 


142 


DCLALSSRLDLQWCDLGSLQAPPPGFKQ 
FLCLSLPSSWDYRCLPPRPANF\*FLVE 
TGFHHVGQAGLKLL T S S DL PAS AS Q SAG 
ITGLSHRAWPDSSFEGSSSS 


6781 


20682 


A 


6835 


407 


35 


TAPAIRGVHLRGRGWTVYGAV*AVPPAS 
I PAYPG WYQ YGF YGADL YG \DMQHSDM 
RGLMVSQHS PMPHT\LQQS \ S VTVMAGC 
TQ P S HTMPL P L P L V Y \ ELAL WR VYAE VA 
TSYLPPAEVMLAPCK 


6782 


20683 


A 


6836 


1 


428 


GKTLNPPSLGKSFGLIGLPPENDWPPDV 
SVPRAAFPLRGPRSVQSWREMEQSVAQ 
LLL * MLTFTPHKR I SAFRAL \ QHSYLHK 
DEGDPE 


6783 


20684 


A 


6837 


3 


612 


FKGSGVFNRKKKLEEHPLWQDVALGTAL, 
CPQLPSLPGWGVIEWPRRAACLPCNPQL 
CPHCCGPQEAQYPL*AGGQ*GQHQEKPG 
A\GPHPPSGPQVPSAPGEAMTAMEGRKL 
SPAVAQNPPQRKNTLPRGPLPRGAPTTT 
P*SLRPR*TPVGAGLAAPIRSHHPGCSG 
QE AIiAL C PDR PHKDRGVRRGAGG PRGRR 
VQLGGIPP 


6784 


20685 


A 


6838 


3 


412 


YRWGFTMLTRLVLSS*PR/CDPPASASQ 
S VG I TG VNHR VRL FFFFFGTG PHS \MAQ 
AEGQGHNLG*LQPQPPGAKGTLPRTGNF 
KGGPHTPPNF/ C I FFGKNGVS P YCPGWD 
FGAKAILPPGPPKELELEGSPPAPGKVF 


6785 


20686 


A 


6839 


419 


26 


EETSRLWSIGQCPLLIQTAQVLSFTSGH 
GRHREHRKPTDSPLPSACPAP/RLTPPP 
SMCPASHSLPSSQCMPGPTLTPPTACAR 
PDTHSPPPSVCPAPLP/CRPVCPPPRSL 
PTPA*TKPRTQLPSSMYCGRSS 


6786 


20687 


A 


6840 


365 


66 


LLERQIGRSVYGPSSKSSIFDVPSLPHR 
GIKLVFGLiCHTSSWDTGVHYAPPA*SWH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














TVVHPRPRQVGQ/AGFELLTSCHPSPLT 
S QTAR I AGVSHRARP KFM 


6787 


20688 


A 


6841 


445 


188 


GFQ* FS CFGLLGGWDTRHVPPRIiADFC I 
F/M*RWGFAMSARLSSNS*SAPSASQSA 
G I TGMS HCARP ATLNS P I CGTS L PLGS F 
FF 


6788 


20689 


A 


6842 


3 


412 


PQQS PQLTTVTAPRPRALLMGTTMANLQ 
TR VS TS HLQGPRLTS ALT YQMAQ VAVAT 
P P WS VLAWVLS ARVTVLP /LN / VNGI 
SWIRQIiQKAAGRQPPSPHQTRNNSLE* 
VASNLSKFLILEVAELFCHVLTFFSYQ 


6789 


20690 


A 


6843 


113 


410 


NHKCNVNFYNGPRSLEKRLITLPILHYF 
USrPILCKSSYLTLLNLKLQTNLFCK/YE 
M/ESHFVTQAGVQWHDLGSLHPKQSSYL 
SLihSS* D YRGMP PHLAN 


6790 


20691 


A 


6844 


2 


357 


FCPLARIIjQSGRFAVIjPWNSGLVRGRRQ 

egphgaftreht/ slrapqprsenerr/ 
riacalraahqasgwrltpraprspsgc 

HCPPAEPTSR*EAWGARRKGHGCCPAPH 
QPAPQDIS 


6791 


20692 


A 


6845 


392 


279 


HFFFPCRD/KSLAMLTRLILNSWA*VIL 
LPQPSKVLGLQV 


6792 


20693 


A 


6846 


3 


275 


VNLVDKAAAGFERMDSNF \ ETS S TVGKM 
LSSSIARYREIFYERKSPSMQQTLFLSH 
FRKLP*PF*PLATTTLIY*QPSTSRQDP 
LPAKRAR 


6793 


20694 


A 


6847 


1 


454 


FFFFETQFHSLPRLECSGAISAHOTFCL 
LGS S S S PAS ASQ WG I TGMHHHAW 1 1 F/ 
VLiFLVETGFHHVGHAGFELLTTCGPPTL 
AS IMC I LVYLLVFQHLVSLKNLKGQNAS 
FYFSPFGQHXjSGFVKCSIWCCGSLSGLE 
GHFLPPHS PPLH 


6794 


20695 


A 


6848 


13 


148 


GSAWRHTPAG\ELLKKLRHRYGLNPEGG 
GCTEP I WCYRTPPWTTD 


6795 


20696 


A 


6850 


204 


30 


T I FSRQVLR I QNVLS E KLRVS T VYANNG 
S \VLQGTLWASVYHGKILIGTVFHKTLY 
CSL 


6796 


20697 


A 


6851 


2 


429 


EVWASGITGKAIMAMPIVIQPKNPSSYP 
CRRQFPLQLEAKEGFQLLTEKFQKHGLL 
IPYN/TPILPVKKSNEKYRLVQDLRIIN 
EAWPIHPMIPNSYVILPQPPDAQWFPV 
LDLKDSFFCISP /VDPSSQFLFAFE 


6797 


20698 


A 


6852 


3 


160 


SGWDHRRHTQLIFMYSFLFFCRDR/SLP 
MLPKLVLNS WAQVI LLLQPPKVLGL 


6798 


20699 


A 


6853 


2 


243 


ACMIIAAATIILRW\PLRYIILIWGIN 
KFTKKLRNPYSIDNNELLDFLSRVPSDV 
QKVQYAELKLCSSHSPLRKKRSAL 


6799 


20700 


A 


6854 


24 


458 


SRAARRAAGTTRRSSHTGCRPRPADPGA 
RCLPRGSPHPRMDPPESP/PSETSPGPP 
PMGPPPPSSKAPRSPPVGSGPASGVEPT 
S F P VE S E AVMED VXiR P L E QALED CRGHT 
RKQVCDDI SRRLALLQEQWAGGKLS IPC 
KEENGS 


6800 


20701 


A 


6855 


3 


183 


CCDPPVSASQRAGITSVTY/Q/WPSLGS 
LQPPSPEFKRFSCLSLLSSWDHRHPPLR 
PACKVFC 


6801 


20702 


A 


6856 


2 


242 


GNHP YARKLA\ AQRRAS TVSS VTQVEVD 
ENAYRCGSGMQMAKDSKSLKTHQTQPGI 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleuciue, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y—Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RRARDKAL \ LVANGKRLWC I EGSVC 


6802 


20703 


A 


6857 


54 


401 


EGERLQKFMHVLKTHTHTH / AHTHERAH 
THTHTRTHTPRSGMHRSNNRNLAKNLFK 
IFFFLSSSPPSLFFLIIFPFLSLSLSSL 
SLSLYKHTPLCILSLRTEHLHRMGALLG 
GHILE 


6803 


20704 


A 


6858 


1374 


579 


CENPSRSTRPGQGPLLLPHQLPFH/RPA 
PSQSSPPEQPQSMEMRSVLRKAGSPRKA 
RRARLNPLVLLLDAALTGELEWQQAVK 
EMND P S Q PNEEG I TALHNAI CGANYS I V 
DFL I TAGANVNS PDSHGWTPLHCAAS CN 
DTVI CMAti VQHGAA I FATTLS DGATAFE 
KCD P YRE G YAD CAT YLAD VE Q S MGLMNS 
GAVYALWDYSAEFGDELSFREGESVTVL 
RRDGPEETDWWWAALHGQEGYVPRNYFG 
LFPRVKPQRSKV 


6804 


20705 


A 


6859 


36 


269 


IALGSMDVLTILMFNEHSKNEIKRITPL 
KIATR/ IKYIjGINLTXRGKKDLDTENYT 
TLIEEIEDDTNKWKDIPCSWTGV 


6805 


20706 


A 


6860 


47 


888 


TLRARALQARPRTGSSCTAATWTS/SGA 
SQHSLRALSWRRLYLSRAKLKASSRTSA 
LLSGFAMVAMVEVQLESDHEYPPGLLVA 
FS AC T T VX> VAVHL FALiMVS T CLL PH I E A 
VSNIHNLNS VHQS PHQRLHRYVEIAWGF 
S TALGTFLFLAE WL VGW VKF VP I GAPL 
DTPTPMVPTSRVPGTLAPVATSLSPASN 
LPRS SASAAPSQAEPACPPRQACGGGGA 
HGPGWQ AAMAS TAI MVPVGL VF VAFALH 
FYRSLVAHKTDRYKQELEELNRLQGELQ 
'AV 


6806 


20707 


A 


6861 


3 


391 


NKISFFCRDWGLPMLPRLVLNSWAQVIL 
/LISTCQPPKVLGLQA 


6807 


20708 


A 


6862 


322 


175 


GCRCVPPPRASFKIFGRV/RGLDILPKL 
VSNSWPQAILPPWPLESLVLQA 


6808 


20709 


A 


6863 


700 


294 


YVLRQGLTLLPRL/ECSSAI IAHCSLKL 
LGLRNP PTS ASRVAGT I GACHHTRVI F I 
VF WI E S CDLAQ AG P KLLGS S D P PALAS 
QSAGIAGISHHAWPPYYFYWVLSSTTPS 
VLSLHSHIi I LAT I LRGGDVTS QTDK 


6809 


20710 


A 


6864 


371 


77 


RSMFAN\NLVYDTSDSDDYHLLKVLEEG 
IQTLMGRLEVGSRRTGQILKQTYSKFDT 
NS IjNHVALLKNYGLIjYCFRKDMDKVETF 
LRMVQCRS VEGS CG F 


6810 


20711 


A 


6865 


273 


34 


DYLPTYLKLFRLLFF/ CLI I ETESHYVA 
QAILELLSSSNSPTSASRSAGITGISHH 
TQPNFSNNTLLKPYLRLSYALLFA 


6811 


20712 


A 


6866 


1 


190 


GAKGMHHYTWLVRYF/ CCVEMRSHYVTW 
VGLE/PPALKHSSCLSLPKCRDYRHEPP 
HIiALFCFSF 


6812 


20713 


A 


6867 


506 


293 


EKPSNGHKKPYTWISIAILFI IAKTWKQ 
S\PRCPSAGEWINC/WYTQTMEYYSMLK 
RNELSNCEKT 


6813 


20714 


A 


6868 


2 


116 


LFPFCRDR/SLTMLPRLISNSRAQGILL 
PWPPKLLGLQT 


6814 


20715 


A 


6869 


10 


141 


GSAWLFFIFCR/DRGLALLPQLVSNPWL 
QAILLLWPPRVLQLQA 


6815 


20716 | 


A 


6870 


2 


603 


APTFINFPAKGKPKRGDTYHLQVRGFSA 
EQIARWIADRTDVNIRVIRPPNYAGPLM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to First 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
KHLysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty roslne, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














LGLLLAVIGGLVYLRRSNMEFIjFNKTGW 
AFAAL C F \ VLAMT S GQMWDH I RG \ P P YG 
P KD P HKGHVN Y I HGS S QAQ F VAETH I VL 
LFNGGVTLGMVLLCEAATSDMDIGKRKI 
MCVAGIGLyVLFFS WMLS I FRSKYHGYP 
YSFLMS 


6816 


20717 


A 


6871 


375 


1 


GF PGF PVPKFG I PAPVAPQNP FP PP S F / 
PQGPPPSPGGPNPNGKNPFFPKTPFFLK 
NSPQKKKIFLKKFKRGPP\GPNQKPPFK 
DFKKKRGLKPPKKGGPLKKKKKKKKRSN 
YSLKKIYSPFGFHFF 


6817 


20718 


A 


6872 


255 


23 


GRVD PTSTS QNAG I TGVSHR / GVQWRDL 
GSL/QPIiSPGFKRFSCLSLSSSRDYRLV 
/PFCIFSRDRILPFWPGWSQTPHLK 


6818 


20719 


A 


6873 


5 


157 


YLLFTSYTHAHTHTHTHTHTQFLLLWTS 
GLS \ S CC Y I FS LSGPNTKCNM I F 


6819 


20720 


A 


6874 


351 


132 


SPRRCSAYGVA/ IYLFLYFLINLLSLYS 
I F YG FAPNS FLCE VQE PS LG \ S GWEPLS 
GNHFS S IKMKI AHREVNM 


6820 


20721 


A 


6875 


99 


337 


AGSFFQKKGKKKISRVNRQPIQL.EKIFM 
NCAS D KGL VPR I YKELE S AR \ KYQS IPS 
KVG 


6821 


20722 


A 


6876 


69 


306 


YVSHHLSFFF/ CFFGGGEKALFFCPRTG 
IKWPQFGLLEPSPSGLKQFFGLNPPETL 
EYRVFFPPPGKFFFFFFFFFFFFF 


6822 


20723 


A 


6877 


335 


169 


WHDLGLIjQPPPPG\SSDSCASASQVAGr 
TGMHHS FSNF I YLFCYFE KVFLTFTLD 


6823 


20724 


A 


6878 


2 


237 


ELGYKVLPHPP YS PDLS PTDYHFLKHLN 
NFL \ FHNQQCAENAFQVF I ESQSTDLYA 
TGINKLI SHWQKCVDSNCSYLN 


6824 


20725 


A 


6879 


2 


199 


RGRVGRQ FQ KE I EE LKKKL E E GEE / VAP 
SSSSSSSDPLISEPDISGSEEDDDEEGE 
VGE D GE KKKKK 


6825 


20726 


A 


6880 


1 


206 


FFFFETBSCYVAQAGLELLGLGDPPASA 
SQWGTTGARHHAQLI F/VFVFLVEMGF 
HHVSQDGFDLLTS 


6826 


20727 


A 


6881 


1 


123 


WYIHTMEYYSABKRRKFLSFATTWMNLE 
\E IMLS KISRRRG 


6827 


20728 


A 


6882 


15 


105 


SQLL/WRLRQKNRLNRGGGGCSFPLWCH 
CTL 


6828 


20729 


A 


6883 


223 


1 


AASTFLFPNLKNSLRG\SLRTFSSVTNV 
RKTALTOLNSQDIQFFFSRRSLTLVARL 
ECWRSWPAASRRLGCL 


6829 


20730 


A 


6884 


193 


3 


QGEQDFLPPCLPNFFNF \ FFCRDTVSML 
PRLISNSWPQAVILPEPHKVLELQARAT 
TPSHVFIF 


6830 


20731 


A 


6885 


2 


355 


RLLTSSDLPASASQSAGVTGMSHRARPR 
NF/CIIjPNLSPCNTHSPLSLPQPLAPTI 
LLSLWICL/SLDIS/WDYRHMPPRLAKV 
FFFGDGVSQ\YRPGWSAWPFAASTSGV 

Cjtt JL Jj ir rr y f fJS. 


6831 


20732 


A 


6886 


1 


151 


HSVAQAGVQWWDHGSLiRPGTPGLKQ/ FL 
P PLKQS S CLSLLS S WDHRCATL Y 


6832 


20733 


A 


6887 


314 


203 


FLFFCGDKVLLCP \RDVSNS WTQVILQP 
WPPKVLGLQE 


6833 


20734 


A 


6888 


334 


86 


QNRS T F KRGM / MGWAW WLMF V I PATW E A 
EVAAKIAPIiHYSLGDRARLSKGGMKCTS 
IFCSKSAKDCQTIiPVKKKISLPTLPSI 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine ? 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


6834 


20735 


A 


6889 


345 


152 


QHARIMSVFIVETGFHHIGQSGLELMT / 
S / GNMHAS AS Q S AR I TG VS PHARP S L P Y 
YKCS KKQLCNE 


6835 


20736 


A 


6890 


2 


169 


ARECSGAITAHCSLDLMGSSDTFTSASQ 
VAGTVGVT IMPRYI FK\ FFVEMESHC I A 


6836 


20737 


A 


6891 


361 


184 


NQSSCFSLLSSWEHRLMPPHLANF/SFF 
DRDE / SLTVLSRLVSNS WTQAI LPPRI F 
FENS 


6837 


20738 


A 


6892 


3 


330 


HESAAITGVSHCTQTFFFF/ CETGPRFV 
PQVKGQGHDLSSGEFGPPRLRNWPGLTL 
QNAGNTGTLP PHRTNFGD FRRGS FS P CG 
PGGFGTRDLGGFPHIiTGKRDTGLDPP 


6838 


20739 


A 


6893 


2 


348 


ARAFFFWNFFCK/NRGLSMLPRIALNSG 
AQGILPPWPPKKLGLRALPTWPKTGSSG 
LEGHSSLYPHHSLIFPHHCSKFCFIKQW 
RL F I LR I TENQGLT PLNAAKP AE P VMNE 
ARTQA 


6839 


20740 


A 


6894 


340 


240 


CRD/RSLRMLLTLVLNSWIQAILLSWPP 
KVLGLQA 


6840 


20741 


A 


6895 


155 


1 


HLCF WAG \ P VAHS CHPGTLGGRGGR I TR 
GQEFETSLLGPGTQDVLAKCSRA 


6841 


20742 


A 


6896 


454 


294 


PSSWDYRHA/PPMHNVVPDFFCRDE/SL 
PMLPRLVWNFWAQLilLLPWTPKALQLQA 


6842 


20743 


A 


6897 


49 


333 


I Y I CLS FYQS 1 1 YLYDKQHDVLETVSHS 
VSLDLPGLRFSHLS LWDHRL. ILPWLAS S 
NNFF\ CRNRGLLVLPRLVSNSWVQAILP 
PWLPIVLGLPA 


6843 


20744 


A 


6898 


434 


324 


FFFFCRD/RSFAVLiPRLVLHS WTQAI RP 
P W PL KVLGLQA 


6844 


20745 


A 


6899 


127 


365 


LPERRRYVS I SGYRYHLPQS LLFFYAQH 
ATFNNELW\LGVIAHACNPSTLGGRDGR 
MALGQEFEIGLDS I VRLHL CNK I K 


6845 


20746 


A 


6900 


208 


1 


FFFFFWILVETRFHHVA/RAGLELLSAG 
NPPTSPSQSARITGIWFFVWFSSNLSQF 
CYAHSFFTCWYSRA 


6846 


20747 


A 


6901 


1 


162 


GTSGTRLFTIGKRWQQPKCPPMAELINK 
MW\YTMEHYSVLKRKEIFCTNLTQLL 


6847 


20748 


A 


6902 


120 


313 


DRVL/WRLRQENPLNPGGGGCSEPRSHH 
CTP AWATRAKL YL KKKKNFKGTL S NW YS 
FRRVANTNLQMQSGSNYMQ I 


6848 


20749 


A 


6903 


340 


234 


I FFCRER \ S ILPRLVSTTWPQAI LLPRP 
PKVLGLQV 


6849 


20750 


A 


6904 


1 


327 


GTSGTSRILPWGPGCIRTPELKQSAHLS 
LPKCWDCWNYRHEPPVYSFAMLI IKLFF 
ELHLSVILWFLLIxANPQTQTESKTFSKT 
/ PRLVLNS WPQAVL PWSLKALAS QA 


6850 


20751 


A 


6905 


532 


359 


RDG/DLVWWPRMVSMPGFKQSSHL.GHPK 
CWDYRCEPLCQDFS YS I IT WFP PSREI> 
RR 


6851 


20752 


A 


6906 


270 


3 


GGTTVYPRKTKT / W PGAVAHACNPNI LG 
GRGGW I TRS GL P VS TKNTK I S RAW C CAA 
WLAT P E AEQENRLKP GDGDRDHHGQHG 
ETPSLSG 


6852 


20753 


A 


6907 


1 


312 


GTSWKKMLPTLMDDFRG\KTLVEELIPD 
WEIAREVELE\MEHEDVTELLQPHDKT 
WTDEELLLTDEQRWFLEVGSVPVEDAV 
NIVEMKTKDLEYYISLLIKY 


6853 


20754 


A 


6908 


331 


61 


FRHVGQAG/ LQKLLTSSDLPAPASQSAG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G1utamic Acid, F-Phenylalanine, 
G=G ly ci ne, HNHistid in e, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R— Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y==Tyrosine, 
X=Un known, ,v — Stop codon, /=possible 
nucleotide deletion, Wpossible 
nucleotide insertion 














ITGMSHHIRPKLILRKKSMRAKLDSRQT 
RLQPP WEQGLLEEQHEVYNVYMAHVS K 
SSSRKLS 


6854 


20755 


A 


6909 


419 


199 


S AVQDGV \ S WCDLGS L \ Q P P L S RLiKQ F P 
CLSFPSTWDYKCAPPHPATFCIFIRDGV 
SPCWSDWSQTPGLK 


6855 


20756 


A 


6910 


341 


69 


EAEVGGLLKARSLRPAWAIQQDPISILK 
KSFTLGGQGGWI TRSGDRDLRASASQSA 
GT TGVS HRAQ \ L I F AF LVE RR FHHAGQG 
GDDLLTL 


6856 


20757 


A 


6911 


174 


356 


KT YLMPLRS S LRPGMVAHACNPS TLGGQ 
GGWITRSG/ESRPSPVLVNRLTRCQANT 
VKPCLY 


6857 


20758 


A 


6912 


391 


3 


SPPPPPFPPSFPPFPPQNFFFPPGASSF 
FGPPPP/YFPPPPKKSPPPNPPPPFFFP 
PPLGKQFFSPPPPFFFPPPGFPFPPPPF 
FFFFFFFFFFFFFFFFFFFFCDKKYFIC 
LMICMLFFCKYILNFCRYS 


6858 


20759 


A 


6913 


43 


331 


IRDTYS FS LS APALAPMLGTKG VLLAVA 
SLGSRGASLCVFVCVCLCVRIH\ IGVQA 
SGCVCVCAC/CVCVCVRVCVCVCPHWPA 
SLLTKDLQCLPFTV 


6859 


20760 


A 


6914 


250 


11 


TPNLWGIaFLVFCFEM/ EVHYIAQAGLE / 
PPGLKPSSSLSLLSSWDYRHVMPHAAQI 
C I S VFS LAQKGVP P SNW I L I TH YS I KS 


6860 


20761 


A 


6915 


532 


53 


SQMALHEGFLLLFLKEKELHGKFSHPPP 
S APNS S MS KWF YFS ETESHS VAQ AGVQ 
WCNLSSLQPPPPGFKQFSRLSLTSSWDY 
TYLPPHLANFLFLVEMGFLHVGQAGLKL 
PTSGDLPALGSQSVGITGVSHHTR\PRF 
LFKIGPLFLFSKSMSQDEARI 


6861 


20762 


A 


6916 


2 


373 


WHTPWLATHEAEAGGSLEPRRAKLQLA 
MIVA/NCTPAWVT 


6862 


20763 


A 


6917 


1 


270 


GTRQSPRLSLLSSSCDHRRVSPHPADF/ 
SFFYFYFFCR/DRGFTMLPRQVLIWAQV 
MLLSQPPKLLGLQWGVSHGARPPHCLG 
YSWVYNKP 


6863 


20764 


A 


6918 


2 


367 


RCAPHCPDNS S FFK/ RVKTGLT I FPRQT 
SNSWPQVILLPWPP/KVLGLQA 


6864 


20765 


A 


6919 


3 


354 


HEVNIVETTAKDLEYS INIiVDKTGFEKI 
DSSFKRRSTVSKMPSDSITCYREIFCER 
KSQSMLQTSLLSYFKKLPQPPQPSA/TT 
TVI S QQPSTLRQGP P PAKRLRCAKGLND 
FQHFWL 


6865 


20766 


A 


6920 


366 


41 


S TS GEDAVH I VEMTT KGLE P / YTAVPGF 
HPSSERRSWGNMLSNSVTCYRGIFGER 
KSQC/RQTSLLPYFN/KPQPPQPSATIT 
Ll\SQQPSTSRRDPLPPKRLQLTESSND 
H 


6866 


20767 


A 


6921 


3 


368 


QLLMGLHLEPATLDLLQLKT\TIQGGSH 
SHHITAIP\PS\FTRENTLMFIHLSPIL 
LLSLNPDI ITGFFLLKKKJSTKPKNHKNRG 
PSYKRTPEGPLLLLWAQKTLPPKGGLY 
KIAPLRLFGEDS 


6867 


20768 


A 


6922 


3 


91 


FSVLPRLVXQNSWPQVICLPWHPKVLGL 
QV 


6868 


20769 


A 


6923 


334 


91 


ERCGKCAHXSEIFLSSI,DYIYIFVFY\F 
FFLFFFFFFFFFFFFFFFFFFFFFFFFS 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 

> 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!u tannic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=Glutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJnknown, *=Stop codon, /=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 














QFKFNTKS KEKMFMKTMKKVQVTF 


6869 


20770 


A 


6924 


375 


37 


TKAGVQWHDHGSLQPQPPRLKPSSCFSL 
PSSWDYRHAPP/RP/ALFLFLFGLVWFG 
LVFLVE I GS C Y I AQAGLKLLGS SDP PTL 
VSQSARIIGPTHHAWPTLTTSIQCTIRG 
PG 


6870 


20771 


A 


6925 


647 


1054 


CFWKAHIQAGETNNIQEGSERFHFCPAD 
EVYVHSGPAVEDQPQRRS FAL / SAQAGV 
QWCNLSLLQPLPPRFKQFSCL\SLPSSW 
D\HRHVPACLAKFCI FSRDEVCHVGQAR 
LELLTSGDP PAS AS Q SAG I TGMNHHV 


6871 


20772 


A 


6926 


3 


204 


DAWETTGFGMI YDS LDYAKKNE PKHR \ L 
ARHGLYEKKKTSRKQRKERKNRMKKVRG 
TAKANVGAGKKK 


6872 


20773 


A 


6927 


1 


112 


PTRPRTRGVASVLYFTTILILIPT\ISL 
IENKIIiKWA 


6873 


20774 


A 


6928 


1811 


478 


DRARSPRSERA\RRSSSRIiRQRDPSLGR 

RRLRREIRPGLPESEPRPPRPPAALTAD 

QPPPRRLSESRGGGGMSE\AGE\ATTTT 

TTTLPQAPTEAAAAAPQDPAPKSPVGSG 

APQAAAPAPAAHVAGNPGGDAAPAATGT 

AAAASliATAAGS \EDAEKKVLATKVLGT 

VKWFNVRNGYGF INRNDTKEDVFVHQTA 

IKKNNPRKYLRSVGDGETVEFDWEGEK ■ 

GAEAANVTGPDGVPVEGSRYAADRRRYR 

RGY YGRRRGP PRN YAGE EEEEGSGSSEG 

FDPPATDRQFSGARNQLRRPQYRPQYRQ 

RRFPPYHVGQTFDRRSRVLPHPNRIQAG 

E IGEMKDGVPEGAQLQGPVHRNPTYRPR 

YRSRGPPRPRPAPAVGEAEDKENQQATS 

GPNQPSVRRGYRRPYNYRRRPRPPNAPS 

QDGKEAKAGEAPTENPAPPTQQSSAE 


6874 


20775 


A 


6929 


324 


203 


L I FVFLVETGFHHVGQDGLDLL / NLVI R 
PPRPPKVLKLENP 


6875 


20776 


A 


6930 


3 


364 


HEASLTQTRTTMTHCSRTTGCSTASGRT 
WTRSRHSCAWCSAALWRAAVASRCPSS / 
IPVTPPQCLSWP/WKVPLQCPPALS 


6876 


20777 


A 


6931 


2 


165 


GRVGFCS VAQAGLQWHDLSSLKPLPP \ G 
SSDCPASASRIVGITSLAFHFLNNVF 


6877 


20778 


A 


6932 


1 


370 


TELRSYSPSNGAISSQCNLCFLGSGYS / 
RSSTSQVAGITGA/R/HHTQLIFVFLVE 
TEFHHV\AKAGLELLTSGDPPASASQSV 
G I AGVS H CAWPHVY I LKE IMLGNLVRCL 
MSICHHHTESICLFL 


6878 


20779 


A 


6933 


1 


441 


RRANTPNNPEAPPN/ QKKKKKKKKKKKG 
KNPPGGPPPFFKTPPPFKNGGGSPPGKK 
KGGE KKRGGKI FFPP PGGKKP P PQKKKG 
GGGGKKIFFFLFGPGGGGNFFKKKKGPP 
PFPPRKKKKIFPTGGGPKKKKKKKAPPK 
KPRGFYFP 


6879 


20780 


A 


6934 


518 


340 


PKPKNFPS/PSPPKIPPPKKKVFSKKPP 
GGFNNPPHKRKKYNFPPPEKLGPPKEFL 
KRPPL 


6880 


20781 


A 


6935 


396 


299 


WEVC/GVCVCVCVCVCVCIHSLCKSTL 
HRTGV 


6881 


20782 


A 


6936 


486 


272 


PTRAPTRPAPPHLAD FL I FCSDR/ S LIM 
LPRLVLNSQPQMILPLWHPKWGLQA 


6882 


20783 


A 


6937 


1 


264 


YSNLRDRARSCLSLPSSWDYTCSWDYKC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 1 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L— Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














ASLRLANF/ SNFCRERGLAMLLRLVSNS 
WAQVT LLPQPSKVLRLCLAVSVFFGEKQ 
GQCS 


6883 


20784 


A 


6938 


2 


407 


TTGTCRSVWLTFVFLWTGFHHVGQAGL 
ELL/RPPQPPKVLGLQA 


6884 


20785 


A 


6939 


422 


65 


LKGPPPSFFFFFFFFFFFFFFFFSP \LP 
YILYFDCASNACLFFKNNTWRYIKRIFF 
SPTPFIPFFHEVENVTSLSYMVFQYCFL 
FF 


6885 


20786 


A 


6940 


462 


209 


LSEFWDYRREPPLPAKFAIFMLCRDRGL 
DTL P S LVS S / S WHQAILAPQ P PIMLGLQ 
VHSAYKGFCLPVLTFEEVIGNRPLGLLP 
P 


6886 


20787 


A 


6941 


491 


299 


CAEYS PESGPTHASAHAS / SNVQMVYSR 
ISC3CEELLLGRTS PSKNYNMMTVSG 


6887 


20788 


A 


6942 


429 


256 


FSHLSLPSSWDYKHLSSCPANFCIFVQT 
GFHHVGQA\GDPPASASQSVGVTGMSHH 
S 


6888 


20789 


A 


6943 


932 


661 


FKTGS YS V\TRLEGSGE I SAHCNLRFLS 
SSDFFHLSSQVARITGSSQYARLIFVFF 
VEMGFAMLPRVGLKFLGFKPLHPP\RTP 
RVLGLQE 


6889 


20790 


A 


6944 


381 


55 


PAS LPPCSLI SD CCASNQRDS VGVGP S E 
PGVGYSIaWRRFLSRSEKRNIRVGVTRF 
SRCV/LS PLSLTQKGNSLTPCASQVRQC 
LALLRLAHGACTHWPAPTVWHSIjVR 


6890 


20791 


A 


6945 


816 


513 


FFIFIYFFEMESLLPRLECSRMISAHCK 
VCLPRSSSY/PASASQVAKITGA/TRHS 
RL I FC VC VL VERRFHHLS QAGLELL / NL 
VIHPSRPPKVLRLQGVTRTA 


6891 


20792 


A 


6946 


383 


234 


LIPSLMDNSEGLKTSVEEVTADWKIVR 
ELELKVKPE/NVTDLLQS\YDKT 


6892 


20793 


A 


6947 


423 


60 


LNPPPPPF\YKPPPKKKKFFSPPPLKFG 
PPTKFFKRPPPFFFFFFFFFFFFWHGTF 
PLNSLVGPPKVEGWVSWAQLPRAVGPPIi 
PNP KWGLNKP PAGGLNKP IHFPSLNGSG 
EATPPLIPWC 


6893 


20794 


A 


6948 


1 


327 


MRLLAAALLLLLLAL YTARVDGS KCKC \ 
S RKE P KI R YSD VKKL \ EMKP K \ YPHCE E 
KMVI ITTKSVSRYRGQEHCLHPKLQSTK 
RFI KW YNAWNE KRRVYEE AQGLRRR IG 


6894 


20795 


A 


6949 


952 


796 


FFLHFLVEMG\ FLLVGQAGLELPTSGDP 
PALAS QSAGI TGMSHRARP S VS S 


6895 


20796 


A 


6950 


364 


6 


I TTHKNYFLSRTATPPKEGLPQRKKI FL 
FFTP I FI FFVGFI FWSGGGRGFLVFFFV 
FFFFFFFFFFFFFFFFFFFFFFFF\RVK 
TFIGFKVLFFKINVFFFLIRAVPLDYMN 
HALTIFQW 


6896 


20797 


A 


6951 


418 


266 


ISFFFXXXXXFFFXXXFFLLFFFPFFFF 
FFFFFFFFFFFFFFFFFFFFFFN 


6897 


20798 


A 


6952 


459 


7 


PSYLYKPGPSQPNRRRHSVP/HIRSNKD 
PPPPEVMKKWGPPTPQKKKYFPPRKKKL 
GGGGGPIYPPKKKVFFKKKPGGFKKPPK 
KKKKIFFSPPEKRGPPPSILKNPPHPFI 
FFFFFFFFFFFFFFFFFFFFFFLIGEKA 
MLLDMGA 


6898 


20799 


A 


6953 


325 


143 


L P S S WD YRRVP PHS AHF KF F CRN / RGLA 
MLPRLAKPRFPS IS IFLNHPQVSEILGA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ II> 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=G!utamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=lsoleucine, 
KHLysine, Lr=Leucine, M~Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X^Unknown, ~=Stop codon, /—possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














TRLAR 


6899 


20800 


A 


6954 


585 


306 


KTESHSAIRSQCS\GEISAHCNLRLPGS 
SNS P AS ASRVAG I KGTHHQ VQL I FVFLV 
EMGFHRAGQDGLRSPALVICPPWPPKVL 
GLEAWPALHC 


6900 


20801 


A 


6955 


2 


155 


FLVETGFHHAGQAGLELL I SGDP PTLAK 
S\DSQDVRITGLSHRAWPFLDIY 


6901 


20802 


A 


6956 


3 


346 


DAWPDAWG K \ RNWP PHQANF YTFVE / TG 
FT/ HVGQAGL E L LGL S D VP T S S SLDAG I 
TGMSHCSRPRPEISSRDVHTIPGALQET 
WHTPDLASS IRSLRCRLDCPHNHCPLDK 
DGVTL 


6902 


20803 


A 


6957 


I 


224 


Q CGR I R SRKDAR KGNVQ 1 1 PRE GRRR TS 
PRYLFPVTGQETDGWG\MGLPAWLLKTR 
STSPPRSPRLHRRRGATSW 


6903 


20804 


A 


6958 


102 


344 


GLLKPKSLKLQCTIITPVN\mCTAAWA 
T 


6904 


20805 


A 


6959 


3 


296 


KEEEEGEGEGGGGGG\EEEEEEEEEEEE 
EEEEEEEEEEHAEFTGHVESITQCKLLL 
CHLS LRVEAGI THLLCFTVRAVTRS QRL 
LTKRRNGSVGHTSE 


6905 


20806 


A 


6960 


3 


172 


RLGL P KCWD YKRE PP / RPGLE LLTSS DL 
PALVS QSDGI KD VGHDTE PQD F YTLGQD 
P 


6906 


20807 


A 


6961 


2 


224 


LALLPRLARSGTI IAHCSL\NRSLRLGL 
SDPPASASESTGTTGMSHCSQPAKLLST 
CIRNSVAETKYYLMFKCK 


6907 


20808 


A 


6962 


2 


330 


KGTLRRQVT YKGTL IR \ h VADTLAET FH 
ARKE WDD I F KVL KKHCQ P R I L YLAKP S F 
KNEGES FSQRKC I TTRVARS KMAKGVLH 
LEVKDISTIIKQMKIRSSLVEQIHKW 


6908 


20809 


A 


6963 


1242 


929 


ETGSCSWQAWQWCGHGYLKPQPSHAH 
/DDPPTASS I AGTTEAH YHAQL I FKKFF 
NSLCCPGWSQVSWRILGSSDPPASASQN 
AGITGMSHGAWPQLS IYRKGN 


6909 


20810 


A 


6964 


1 


378 


KFFLQLFLHSFIYEHFLFFSFLFFFFFL 
RPS FAFVAQAGVQWRDLGS PQPLPPRFK 
QFSCLSLLSRWDYRHAPPQSANF\EFLV 
ETGFLHVGQAGLELPTSGDPPTSASQSA 
GITGVSHRTRLAFY 


6910 


20811 


A 


6965 


393 


43 


VSPFPLKNFYFSFTPKTFVGGGGPSGPP 
PQKRFFFHTPNSPFFPFPPF/YKSGPRP 
GFFFTPPPEKGKNFP\PPLRLGPPPVFF 
TRPPPPFFFFFFFERQRTCSFLHFLWR 
PLFRFYL 


6911 


20812 


A 


6966 


225 


2 


TRVE CHGVFFGS CNPLLPGLKE FSAPAP 
RGSGNSGPFPPPR/RNSFFFFFSWLLE 
MGFHHGQADLE ILTSSDPP 


6912 


20813 


A 


6967 


8 


341 


FVSVPPILHYLFKIFFFFLRRSLCLPPR 
LECRGTISAHGSLGLPGSSDSPS/STKN 
TKLPLRGGTCLGPQLLGLLRHEKTWTLG 
GGSGSYPKSGKRAMALQGGVQNKTCVPH 


6913 


20814 


A 


6968 


29 


362 


DYTCKHPHSSELKKKKKKKKKKKKKKKK 
KKKKGGPPKKKP/HGGPQFPPGRQKKIF 
P FKGG FKKPPRGF F E KNP F FGGGNLGP P 
PPPKNKPPGEKKNFLGGRGQKNSFLSAW 


6914 


20815 


A 


6969 


391 


17 


VFFFFFLKKFFPPPDKKLFFPPFPLKIF 
F/SPPUTPFFFLGGFSQISPPPKKGFFPK 
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SEQ ID 
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nucleotide 
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sequence 
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09/515,1 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-TroVme, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IPPGV/YFSPPFKKKNFPFPPPLNFGPP 
RVFFKGPPPFFFFFFFFFFFFFFFFLGL 
RGNAGDCNG YGD VS YK 


6915 


20816 


A 


6970 


374 


268 


IMLPRLVLNSWPQVILLPWP\PKVFRLQ 
VRAPVPG 


6916 


20817 


A 


6971 


614 


315 


FFFGDALSPGLE CGGAVLARLQA / CLLG 
SHRSPASASRVAGTCKRPPPRPSGFFVF 
LVEAGLHRVGQDGLNLL/NLVIRPSRPP 
I VLGLQGMSHHARLF YFL 


6917 


20818 


A 


6972 


861 


528 


FETESRSVTQAWQWCNLSSLQP/LPPG 
FKRFFCLSLPSSWDYRRTPPCPANFCIF 
SRDGVLPWWPGWSRTSDLVIRLPRPPQC 
WDYKREPLRLAKSRHPWNHLPEQEVEHF 
K 


6918 


20819 


A 


6973 


274 


119 


P PN \ FVFL VETGFLQGGVKLSTPGDP P P 
PASRRAGITGVSHPACPGKEYIFKN 


6919 


20820 


.A 


6974 


370 


194 


KITNFAKIiFGYILPYSHYFWCLEKNRNR 
/SLTLLPRLVSNSWAQAFLLLWPPKVLR 
LQA 


6920 


20821 


A 


6975 


2 


239 


ARGPCS S I / SDCCASGEQGS VGVGPAE P 
G/MGYNLLACCLLRPLEKCS IKAGVSRF 
SWYSLSRLPLT\RKGNPPTPCISQVR 


6921 


20822 


A 


6976 


361 


270 


PQWLVPVIPA/LLETEVGGSLEPRSSRP 
AWAR 


6922 


20823 


A 


6977 


363 


48 


GPPKEMCPYPN\PRPYECNLFGKRVFAD 
VIKFKI LRW\ I I LDYTQVDPKSNDECPY 
KKQKRRGHRQ I KRKS QADQGGRDWSGAT 
ARNAKM I TSVGEDVRKRKP LAYC 


6923 


20824 


A 


6978 


9 


163 


AGYNFLVCC\LPRLIiEKCS IRMGVSRFS 
RYHLSWLPFARKGNSPTSCTSWVR 


6924 


20825 


A 


6979 


408 


212 


IETGFCHVAQDG/LELLASDDPPASASQ 
SAEI TDVSHRAQPHSAFLP WFS SKI FAY 
GSVWLVFPHS 


6925 


20826 


A 


6980 


395 


254 


LIFVFLVETRFHRVGQAGLELLTSSDLP 
SIiASG\ ITGVSHHSRPSSAF 


6926 


20827 


A 


6981 


408 


2 


KVSFFFPKGPRGKKKNGAGGFSPFFPPF 
LGGGTKNFFYPPIFSQGGGKK/ QKPPPP 
EKKKKFPPNPAFWGP PLVLKNWGGRPGGP 
LDPKVSNFMGAPPFLFGAKKKTFFSPPP 
PKKKKPQKKKGGGRSRSRTSRTRG 


6927 


20828 


A 


6982 


2 


302 . 


RKHLPPH \ P VI FVFLEQATLRHVGQAGL 
EIiLTS SDHRAS AS QS AGI TGVSHCS hPA. 
TSFSTP P FS VAKMLRGTQKMKVPL I GPL 
PQP IRLVAGHYF I Y I T 


6928 


20829 


A 


6983 


2 


375 


RGRVGG P VGG P VGRT F I F FQAG S D S AR I 
LVLFPYCTLPHPLPGLFFFFFKTGIKSW 
VLLNPSPRFVIFLEKKIFLPQPPFF/HI 
LSPPHPGGAGGGNPTARKIKGLSYRDPK 
EGHKRGWAHTPTP 


6929 


20830 


A 


6984 


1 


146 


PRPANFC/DFLVETRFRHAGQAGLKLST 
S S DLiP T Jj V b Q S AGTTGiyi S HRA 


6930 


20831 


A 


6985 


2 


95 


LTMLPRLECNDTIPAHCSLNS/SGSSHS 
LTSTSKVAGTTGVHHHFWLMF/PLFI IQ 


6931 


20832 


A 


6986 


340 


169 


VCS SGLS S PLLEQHKTNLIFYASGDI CT 
/ANGKSGFNQPQPFLKTFCCTHRIMSCT 
YIi 


6932 


20833 


A 


6987 


346 


218 


PCLTNF/ S I FCRDE / SLTML/ PRLVLNS 
WAQAILLPRPPRLLRLQA 
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SEQH> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny1alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 


6933 


20834 


A 


6988 


345 


141 


NRPKCPSMIEWIKKMWHIDTMEYYAAIK 
KDKFMSF\ETITLGEVTQEWKTKHRLLS 
LIRGS 


6934 


20835 


A 


6989 


154 


236 


ELISFSRSVERCISSFIKVLFVETDLFT 
LSLSPFFSVLFFFC/RRWGFTMLPTLVL 
NSWPQVILLPWLPKVLRLPT 


6935 


20836 


A 


6990 


2 


156 


FLVE / TGFVHVGQVGLELLTSGDLPASA 
SQSVGIIGVSHRVRPCFLFVCICLV 


6936 


20837 


A 


6991 


310 


96 


KN F F F FL VKKS P / LNVAQGGL KLLG S ND 
PPASVSQSGGITGVSPRARPLNFYKEAT 
DPNTSVLKGLNIKKKKN 


6937 


20838 


A 


6992 


15 


279 


NLLCFPSISPLIGIFFFFERGFHHVGQA 
GLKLLTSGD /LRAS PSQRARI TAMTRHP 
HPLVCGLLNSTHCDWRGRI PHWGFDLRF 
REG PS 


6938 


20839 


A 


6993 


127 


352 


KLADTPQSRFVLVCV/LIETKYSIiCCPG 
GLQLLASSNPPTSVSQSATITDVNHHAE 
SQVFLMLWPKSSKPTNTGL 


6939 


20840 


A 


6994 


323 


119 


SSSEDHKHVPPHSAMFCFF/ CLRQSLTM 
LPRLVLNSWAQVILPPSPPKPFLDALKK 
KSSKWMSSLLIT 


6940 


20841 


A 


6995 


230 


1 


FFFETESCSVTQAGVQWCDLGSLQPLPP 
GFKQFS CLS / LPNS WDYRHALP CPANFC 
IFSRDGVSPGWSGWSRIPDLG 


6941 


20842 


A 


6996 


346 


121 


RDICTPMFVTALLIAKIRSPHKCPSVDT 
WIRKMWF\ IYTME YYKAI I KtSFKILGMSH 
HAS L P VFNHM C KL KYL I S 


6942 


20843 


A 


6997 


2 


198 


SNNTWAFREPGITGMCHHAQLIVLLYM 
QFHHVDQDGLDVR/NLVICPPWPPKKLG 
IQALNQPSPDI 


6943 


20844 


A 


6998 


183 


31 


MRHL ICS S Q I Q QGT VGHACNP S T LGDHGE 
RITQGQEFKT\GWIVRPCLYKD 


6944 


20845 


A 


6999 


3 


471 


LALSCSGTILAHCNIiHLLGPSLPPTSAP 
RAAGTTGVHHHAWL I FVFFGKDGVAQPG 
LKLSFK/VIRPPQLPKVLGLQA 


6945 


20846 


A 


7000 


483 


239 


GN1SINILFLFLFFIETGSHFVT\RLECQW 
VQSSAHSQPSAPRAQVLIFVFLLQTGFH 
YVG\QAGLQLLTSSDPPHSALLKVRD 


6946 


20847 


A 


7001 


132 


353 


YAKLGTRGFARGP WPGGFLTVAVWMS F 
AQAGINPFVCIFSNRELRRCFSTTL/PL 
LQKIQVTKGTLLCYMREHL 


6947 


20848 


A 


7002 


2 


399 


EDAQEVELQEGKVHSESDKAITPHSQEE 
FQKQERESAE\SELTES 


6948 


20849 


A 


7003 


2 


135 


I FQFIi VEMGFHHVGQAGLELL /NLMIHP 
PRPS KVLGLQAYWHS TRP 


6949 


20850 


A 


7004 


2 


114 


I FQFLVEMGFHHVGQAGLELL/NLMIHP 
PRPSKVLGLQA 


6950 


20851 


A 


7005 


455 


1 


ALFPAWLWLPS S S KGGLKLSHGTTLTLF 
LLPSS TLKDLWPQE QWL/GTCNS QHFGR 
P RKADRXAPGVQNR PGQRGE TPS RKKGG 
EGNWGGGGELHIYTYTHTHTYVHTYIHA 
NWTWWCVPWLAFQETEVGGPLEPEVAP 
AWVTVRPCLKKK 


6951 


20852 


A 


7006 


371 


172 


QVILCLSLPGSWDHRHTPPHPA/NFLVE 
TRFQHVDLE LIAS ND P P ASAS Q SVG I TG 
VSHCARPCYDL 


6952 


20853 


A 


7007 


486 


252 


PQTVFFFFFFFFFFFQAIiFSTFYILHVF 
FFFFPPFFPKFQFFPFP/LSLIKQGKFF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue x>f 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H^Histidine, I=Isoleucine, 
K— Lysine, L=Leucine, MHYIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptopban, Y=Tyrosine, 
X— Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFCYLCNSLFIWTQYFFVNE 


6953 


20854 


A 


7008 


1 


113 


KNFCR/DRSLDTLPRLISNSWPQVTLLP 
QPPKVLGLQA 


6954 


20855 


A 


7009 


105 


383 


SLHRPAHPGLLKFGKKAIHS F IHLCRDK 
GDLAILPRLVSNSWPQVILIiPWST/NVL 
RLQA 


6955 


20856 


A 


7010 


609 


348 


ESHSVTQAGVQWYDLDSLQPLPPGS\SD 
S PAS AS HVAG I TGMRHHTQL \ I FVLLVE 
TRSHHVSQTKFVLFDTTKFWIFYSGKR 
TRIRL 


6956 


20857 


A 


7011 


382 


233 


DLEI FRGKTFLHHI FPFFFFFFC/RRRG 
LAMLPKLVLNSWPQVILPKCWD 


6957 


20858 


A 


7012 


359 


199 


DLLPSLPPFLPSFL\FFLFETESHSSPR 
LECSGAIPAHCSLGDRARHRLKINK 


6958 


20859 


A 


7014 


3 


274 


CFFFFFETGSHSVTQAGVQWRHLDSLQP 
PPPGCKRFSCLS/LPSSWDYRHV/PSSP 
LLVRNSSICLLSCFEMTPELFDGVLAIY 
CCVTNDLKA 


6959 


20860 


A 


7015 


372 


54 


LPSAWEAEMRGPLEPKGLRSQRAEIAP \ 
CT PAWVTEQAYVSNKTNTERP FQKI EHV 
EGLSCTTQKFSSND 


6960 


20861 


A 


7016 


548 


158 


KFDT / GATLFDGRPAVFDEA/ DFPS VAV 
YLTGAEYTGEELDSDTWQAELHIEVFLP 
AQVPDSELDAWMESRIYPVMSDIPALSD 
L ITS MVASGYDYRRDDDAGLWS SADLT Y 
VITYEM 


6961 


20862 


A 


7018 


396 


34 


QKNE I KPTMQLHLTPARMAI I KKILKNG 
GWHGCGE / METLLHCWQE CKLVQPLWKT 
MWRFIi KE P KVEIi P FD PA I P VLG I QQ S QH 
WESSNPSTGDPAIPPLGIQQSQPRRKVI 
KKRYLHTRL 


6962 


20863 


A 


7020 


2 


241 


FPGPTAARRRQKEDRYEEYDKVLSDKLK 
EAETRAEFAERSLTRLEKS IDDLQDEL \ 
YAHKL KYQA I S E E LDHALNDMT S I 


6963 


20864 


A 


7021 


347 


3 


NDL KS L THS VAQAR VQ WQ \ N I S S LQR P P 
PQFKRFS CFSLPS S WDYS LLNCKFYHLH 
FADEEIETLiTSQRAQLECKSNMYKDIHN 
TVRSYKLKYKVPQKLERSRNRTKSSTRE 
VRS 


6964 


20865 


A 


7022 


2 


160 


LREDDRRGRGFHHFGQAGLELLTSSD / L 
PTSASQSAGITGVTHRARPDWSFCQF 


6965 


20866 


A 


7023 


3 


248 


DRVSSVAQSSLELLSSSNPHLGFPKCWD 
YRPNPPLPFSSRSLLFLPSFLIFCRDRV 
SMLPKLASSFW\VRPPWPSKVLGLQA 


6966 


20867 


A 


7024 


483 


143 


PLNPLKFFFFPKAFKFWGGVGP I CSPPK 
RRVLSQKSQVGFYFPPQKEKGYTFPSPG 
KFGPPKEILKRAPPFFFFFFFFLRRDK/ 
SLTMLPGLVLNPWPQAVLPLQPPKALGL 
QA 


6967 


20868 


A 


7025 


414 


81 


PHPLKFFFFPETFYFWWGVFSPFSPPGR 
KVPPQKSPGGFLLAPPKGKSFPFPCPFK 
IGPPRGIFKRAPPYFFFFFFFNRRDK/S 
LTMLPGLVLNPWPQAVLPLQPPKALGLQ 
A 


6968 


20869 


A 


7026 


16 


218 


KFAPRCRNSARENTQNQPRKHSDTPFLK 
K/HKIN1SJNNKLAGRGTHLWSQLLGRLRW 
SDHLSPRQSRLQ 


6969 


20870 


A 


7027 


28 


272 


EFVGVGPSEPGAGYWLL\CAFLSPSEKC 
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amino acid 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
acid 

residue ot 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R— Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SIRVGVTRFSRCHLSPLTLTKKGNSLTP 
CASRVRQCLTLLRLAHGALHPLSSTP 


6970 


20871 


A 


7028 


277 


480 


LGIFHMLSDSFLLLLLFCLFVFKIFFGI 
FCRD \ RS CYE VWAMLPRL VSNS WAQ VI L 
PPWPLKVLGLQA 


6971 


20872 


A 


7029 


24 


153 


S VW WNS P/RSSGE FRKTGVHDADFE AH I 
IDMLAKASKIEVYRGN 


6972 


20873 


A 


7030 


2 


470 


I E T P R P LW WHS AYRGRQAS L S CGGLH P V 
R / AS WLLCL PNQ AWAMAGAP P P ASLLP C 
SLISDCCASNQRDSVGVGPSEPGVGYNL 
LVRRFLSQSEKRNIRVGVTRFSRCV/LL 
PLSLTRKGNSLTPWASQVWQCLALLRFA 
HGARTHWPAPTVWHSLVR 


6973 


20874 


A 


7031 


487 


107 


S P PAPGVAPAP P P P TRVFS KKKKKFFF F 
FLKKI FFFKPGGDPGGIFFFPGGPPPGP 
KKFSPPLFPKKGGYRVFPPPPGKISIFF 
FFFFFFFFFF/ CRDR/SLAMLPRLVLNF 
WAQVI LLP WS P KVLGLQA 


6974 


20875 


A 


7032 


576 


36 


GYTSQMGRPGRGAPHFPDRAAGQRRSSL 
PRWGGWAEALLTSQMGWRPGRGAPHIPD 
DGQPGRGAPHLPDEEQLGRGAPHLPDEE 
RPGRGAPHFPDGAAGQRCASHPTRGGRA 
EVLPTSQTGQPGRGAPHLPDDGRPGRGT 
PHLPDG\GPGRGAPHIPDGAARQRHLSP 
PRRGNCTTRIVDQ 


6975 


20876 


A 


7033 


2 


129 


YGPTHASGAMLRSCAARLRTLGALCLPP 
VGRRLP \ EASRDPS 


6976 


20877 


A 


7034 


1 


287 


RLSLALVARDGVQWCDLGSPQPPPTGFK 
RFS CLS L P S S WD YRHVS P R PANF VFL VD 
TGFLHFGQAGLELPTSGDPPALASQSAG 
I\TGVSHRAWPK 


6977 


20878 


A 


7035 


3 


315 


HAS AHAS ALF YF I FFETRS RSVAQAEVQ 
WRKLGSLQPPRFKQFSCLSHPYRHAPPH 
FANL VFL VETG FA\ HVGQAGL E L P TS SN 
LASQSGGITGVSQRARPGLNF 


6978 


20879 


A 


7036 


430 


304 


VI FYL I FFCKDGV/ LTMLPRLVLNS WLH 
AILPPQPPKVLGLQA 


6979 


20880 


A 


7037 


3 


256 


FFF FNRQKDL FGNFQ TCQKWHS PS PR P T 
/ APHQAP KGNT S P P S L P IHLT LAS T CT Y 
P I PQAMLENWSVP S PAQAAGEAAL WHPR 
R 


6980 


20881 


A 


7038 


13 


481 


FARLHE FGTSR VI YLLARLVSNS WPQ VI 
CLPWPPKL/ LGLQA 


6981 


20882 


A 


7039 


455 


255 


S CLSLPS S WDYRRAPRHPANFR \ FLVET 
GFRHVALAGLKYPPALASQNAGI TGVSH 
HARPEQFILTN 


6982 


20883 


A 


7040 


490 


172 


KCWQG/C/GETGTLVHC/WIGRNCTMMR 
PQWQQAWQVFKKLHRELPSDPAMPLLGV 
QPGGLKIDTQT/RYMYMALLTVKRWKQP 
EGP P VDK/ LDKL W Y I HTTE YRVT I KKK 




zUoo4 


7\ 




4.38 


37 


T I?DWT T) D ITT UVPDritPC nDTTVT TlirnT T TV 

MAKRQSQGKVFETPKEK/ SRAGPPPKFT 
TGFSAE KNGGPKPGGYI SQRVQKKKNNC 
HS R I LHS AKLS F KNR S E I KT FLNKQKL K 
/DFTASRPALQKILQEAEFHRTD 


6984 


20885 


A 


7042 


292 


463 


I E A I S FL P F FFGD S LTGAQWRDLG S LQ P 
LPPGLKRFSCLSLPSSELP/CSGRCQS 


6985 


20886 


A 


7043 


59 


257 


LASLVSQSWMKKMWYIYTMKYYAATK\G 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K= Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovyn, *=Stop codon, /=possibIe 
nucleotide deletion, \=possib!e 
nucleotide insertion 














NEIMSSAGTWMKLEAIILSKVTQEQKTK 
HHMLSLRSGS 


6986 


20887 


A 


7044 


279 


14 


RTADTE I GRGS S L C WRYGS LQ I QVS KS H 
NFFFETGSHSATQ YGVQWHNLGS PQPQT 
YGLQQSSHCGLPKCWDYRE/LPPSMECS 
GTISAHLSLKLMGS SNPLTVAS QSAGI T 
GNSSSRLHDINYGLQ 


6987 


20888 


A 


7045 


578 


253 


AAASTFSCFFVFFETEPNSVA\RLECSG 
ATSAHHKLCLPGS SNSPVS APQVAGI TD 
AHNHIQLIFVLVETGFHHVGQAGLELQ/ 
NLVISLPWPPKVLGLQAWSHHAQPFV 


6988 


20889 


A 


7046 


3 


229 


FFKTGSLSVSQAGVQW/PNLSSLQLPPP 
RFKQFSCLSFPSSWDYRHPPPHPANFCS 
FSRDRVPPCWSGWSQTPDLR 


6989 


20890 


A 


7047 


367 


151 


LSLLSSWHYGHVPTSPANFFYF/ CVDTG 
PRCVSNSWTQAVCLPQLPRVLGLQA 


6990 


20891 


A 


7048 


370 


141 


CQPPHPANFCIFVEIVFCHVGQTGFKLL 
TSNDLPAWASQSAGIWD\TGVSHHAWPS 
RFNSEVQFDYVSNKNVDFSLS 


6991 


20892 


A 


7049 


104 


361 


VTAP GLE AAFKE R VW PGAVART CGP STL 
GGRGRRl\RGQEIKTILANTLKPCLY 


6992 


20893 


A 


7050 


2 


325 


RFSCLSLPSSWDYRHEPPRLAIFFFFFF 
GI I RKGGF TH \ VGRGGFKP WT S GNM PAL 
PSQGVGFPGLFS PPRPVRGFFYPLPKWG 
GL F LQWG VKFKKGG S GDLF KM E KK 


6993 


20894 


A 


7051 


14 


208 


AHE ILSEI ENPLEPGWDHRCVS S CPAHF 
FVF / CYRDGGL PMLPKRVLNS WAQAI LP 
WAPKVLGLQV 


6994 


20895 


A 


7052 


103 


368 


GNRVHT YMLTALFMTAKKWNKP KC/ PVS 
DEQIRQ I WS VHAMEYYSAI KRNEALI CV 
TTWV3STLENIMLSESSQKTMGCMLRRS I Y 
MKGPE 


6995 


20896 


A 


7053 


421 


2 


LFLP PE VARGFP KGRGRAS P S PGGFFFG 
PRGGPIFPPP\SFFPTRPWSKGEVFPSP 
TPGKGFRGFFFPPPPKGKGGVFPLFFKK 
YLGGEGAPQNKGGPVFLLWPPHSGPPGP 
PKNFFFFFFFFETESCYVYPGCSAWQS 
Q 


6996 


20897 


A 


7054 


489 


40 


I PEAPGGPGFKREDS PGANKSGPKAL YP 
P KTFHP PWEGRFRS FQKGAP P P P VFFP P 
FFPLPFFFLRQSHS I PQAGVQWRDLGSL 
QPPPPPGFKQFSCVSLPCPP\RQMGFRH 
VGQ AGLE LLTSGH P P D S AS Q S TG I TG VS 
HCAEFHRTD 


6997 


20898 


A 


7055 


3 


261 


TGSGSVTQAEVQAEVMAHCSLNLLGSSD 
L S ALAS RAAGAAGTCHHAWL I FVFFIEM 
GFHYVPRL/GLELWARVIHPPQPPRVLG 
LQV 


6998 


20899 


A 


7056 


1198 


1040 


FLWK IALLF YFKKLPQS LQPL VATTLS S 
QQPSTSRQDPLPAKRL\RLAEGSDDR 


6999 


r\ f*\ f\ r\ 

20900 


A 


7057 


271 


33 


L F FF F KME S RS VARLE C SG A I LAQ CISfLA 
RLFLVE\QGFHHVGQDGLGLLIHPPWPP 
E VLGLQAQATPP S PKI TYEAA.TVM 


7000 


20901 


A 


7058 


2 


194 


CYRRQPPHLAMNFYFLLNFFCLFFC\KA 
LSLLARLLLNSQLQASLPPQPPKVLGLQ 
ACGTTPCP 


7001 


20902 


A 


7059 


3 


272 


NMWKLNNILLNNQWVNGEIKMKI FKNCK 
E \NGNTT YQNLRDAAEAFLKGKF IAVNT 
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SEQ ID 
JVO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F~Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YIKKDYRKTCLRLFYSYHFFNKEKEYTL 
KTMIKV 


7002 


20903 


A 


7060 


397 


129 


FWGPHKKNFPLPARGRKLGSFKTA/LPL 
FFFFFSFSLSLFFFSGS CSVS \RLKCSG 
TITAYCSLEFLGSSNPPASFSQAAGTTF 
TFFLFIG 


7003 


20904 


A 


7061 


3 


351 


S C FRLLCLPKEAWAMAGAP P PASLP PCS 
LiISDCCASNQCDS/VGVGPSEPGAGYNL 
WRR F L S GS E KRN I RVGVTR F S RCRP S P 
LSLTRKGNSLTPCSSQVRQCLALLRLAH 
GARTH 


7004 


20905 


A 


7062 


60 


324 


DDFVS WQTGVHWCGRGSLKPQ P PRLS S 
TSRLSSPSSWDYGRVPF\NIFCRDRVSL 
CFPGWPRTPDLRGSSCPSPKWKLHTRV 
TAPGQ 


7005 


20906 


A 


7063 


481 


160 


RPGAPG/ PDDFARMGPRPKGGAFTLVHP 
MPSGAAFPGLLSE P \ PHRGPRS FAARS P 
TLRYPGRAPS KGSGEDPARP PATGPGPN 
LIQDGLVLMSLTEMVC 


7006 


20907 


A 


7064 


65 


481 


CLCPAPRGGAYRGRQASLS CGGLHP VRA 
S RKL CLi P KQAWAMAGAP P PAS L P P C SL I 
SDCCASNQRDSVGVGPSEPGVGYSLWR 
RFLSRSEKRNIRVGVTRFSRCV/LSPLS 
LTRKGNCLTP CAS QVRQCIALLRLiAHGA 


7007 


20908 


A 


7065 


392 


183 


GGLKP/ IPGNPGNSLFSQKKKKQPGAGG 
S PRGS P P PGGLGGG I FFGPKGLGS INPN 
FGPAP P PGGQKKKS LF 


7008 


20909 


A 


7066 


538 


248 


FFLKQS CS VAQAGVHWGYLS SLQTLHPR 
F\KFSCHSLSSSWDYRCAPPRLAN\FVL 
LAEMGFHFHHIAQAGLELPTSNDLPTSA 
S QNAG I TGVSHHT 


7009 


20910 


A 


7067 


2 


127 


VTGQAWHACS PGTLGARAG / W I VWAQQ 
FRTSLGIMVRPCLYL 


7010 


20911 


A 


7068 


379 


217 


KKMFCKNE/SFTIFPRLVLNSWAQTILP 
P WP P KVLGLQ VQTT VPGS YS PL S IjYKE 


7011 


20912 


A 


7069 


386 


176 


KRD KVDKWliKKM W C I HIME Y YLA I KKE T 
LPFATWVWLQ\TFILSEISQEQDKYHMI 
SLICGIKMLIYRIRE 


7012 


20913 


A 


7070 r 


30 


408 


NSSELGSVGVGPTEPGTGGNLLVCWLLR 
LWEKCSVWAGVSRFFQ/ CTDYDGFPWVG 
KGNP P TP CASQ VRRHPALLRRTLHGLH P 
CASHEKNQVPQLEMQKSSVFCIDLTGSC 
RVELFPFGHLGSKNP 


7013 


20914 


A 


7071 


310 


414 


HVP WR S FAP AHCNLGS LQ P P P PGFKRF S 
CLSLPSSWDYRCPPPRLAN\FCSGSLQS 
WLTAT S AS W VQAI L L S QP P E 


7014 


20915 


A 


7072 


353 


243 


IFIFCRV/RGLTILSRLVWWFWDQVILL 
PEPP KVLGLQV 


7015 


20916 


A 


7073 


1 


276 


ELWDNFKEPNIYLKSLKERKECVCERGR 
RDRNKTLEDIMAKNFSNWMKI INPQIQE 
AQQTPN\QSTGRKIaALRHI TIKLLKTND 
KKKILKAV 


7016 


20917 


A 


7074 


362 


133 


GGGGGGGRYS PQKKGTLPLTP PSNWGP K 
GAPPPRGNFFFFFFFS \RDGCLAMLPRL 
VLDSWAQVMIiLPQPLKVLKL 


7017 


20918 


A 


7075 


3 


177 


KWS PHHS L PNN \ WL YRHM P P C P ANVE F F 
CSIAVIiPRMVSNSWPGAILSPWPTRMLG 
LQT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D— Aspartic Acid, 
E=G!utamic Acid, F— Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, JL^Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7018 


20919 


A 


7076 


1 


169 


PQLPS SRNYRHVP P CWANFC I FS /RRRC 
FTMLARLFLNSWPQVTRPHWPPKVLGLQ 
A 


7019 


20920 


A 


7077 


402 


231 


PSCLSLRSSWDYRCMPPHLTNYYYYYYY 
YFVEMGSHS VTRPQDILPPWP \ PKVLGL 
QV 


7020 


20921 


A 


7078 


422 


316 


FCKDK/SLPMLPKLILNSWVQTTHLPQP 
PKVLGLQA 


7021 


20922 


A 


7079 


58 


293 


DP I CTKS KKLKKTQ \TDKEDE I TTKKAK 
KAKSKIKIEEDGEEGKWWEEEVSVKK 
KKDKKKPWLGMVAHTCNPNTLGG 


7022 


20923 


A 


7080 


363 


123 


GVF FRDHARAFSLHTGVC AVH ISRCF/H 
VLTHFKE F LFHW FLT I HTHTHTHTHTHT 
HTPSFPELGKAVPMHRIVIVPTNYQH 


7023 


20924 


A 


7081 


3 


181 


YFLL\FFWDRVLPCHPGWSKGWSQTPGL 
GQSAHLSLPKHWVCRREPPHPALFLLFK 
NTIS 


7024 


20925 


A 


7082 


453 


142 


FFXXFXFXXXXXFFFFFFFFFXPPFFFF 
FFFLKNSFPYFFLFFFFYPLFLPLFFFF 
FPPNFFFFFLFS *NFFFFFFPPPPLFFF 
FFFFLFFFFFFFFFFFFFSF 


7025 


20926 


A 


7083 


268 


92 


KNNLP S S WG \ YKCS PPHKGNRFFFYI KD 
GGGLIMLLRLISNSWAQVILLSWPSKVL 
ERQA 


7026 


20927 


A 


7084 


413 


0 


SCSVAQDGVQWDDLGSLQRLPPKNKPFF 
CLSSTPAA\FPSDWDYR 


7027 


20928 


A 


7085 


400 


125 


KTRSFSWRGGVQIj/CI/GSLKPRTPGV 
KS S S PSAS P VGRNTGMC PP PQQ I FFFL V 
ETGSHYAAQASLKQSSCLGFLILFLSQP 
LGEKEGPTICT 


7028 


20929 


A 


7086 


377 


38 


NPGPGFYFWGPKKKIKPFPPPGVKLVSF 
KRAPPFFF/CFFFETGSPFVAQTSLELL 
DSHNPPTSASQSAGITDMSHHARPMHS I 
CM I QL WKS YHR FHF 1SQGSRD CS S SREE 
T 


7029 


20930 


A 


7087 


1 


151 


S FCRD / KRIiIMLPRLiVLNS QAQVMLLPQ 
PPKVLSHHTWPI IKFSFMHFNLF 


7030 


20931 


A 


7088 


405 


111 


PRLVLN\SELRRSAHLSLPKCWDYGPEP 
PCPAQMHVLKSGTAAHKVHDFLASWSHL 
PTHS P FP VLCS KTQ I SVAFFI QNLRAGQ 
RSGQRHFGKQCGS IH 


7031 


20932 


A 


7089 


1 


309 


KQ INKS S LR I SGKP PNCEC VLHLGKS VN 
KSLLEPHKLASDP/HLCDFLESS\EQVK 
S VKELNCHV \TS LYTLGALE S DMAEYF \ 
, JSTKYTLGPGMLAHAYNSS ILGG 


7032 


20933 


A 


7090 


414 


240 


FSLF/ CKNLKKGLTMLPKLVSNFWPQAI 
LLP Q P P KVLGL 


7033 


20934 


A 


7091 


460 


328 


RHPPPYLGK/FFCRDR/ SLTMLPRLVSN 
SWPQVILSPWPPKVLVL 


7034 


20935 


A 


7092 


388 


199 


KQFSCLSLPSRWDY/SALPPRPAHFCIF 
NRDGFS PYWSGWSLGLPKCSD YRRRIFL 
RAKAHVRE 


7035 


20936 


A 


7093 


18 


271 


MHSFATYLLVKLHLLVLCRLCPLSMYSG 
EKSYILLTKRPMLWF/SSGPFPSWSFSS 
I TLRI FLTCLTLKL S GCSCCCCCYCCCC 
C 


7036 


20937 


A 


7094 


415 


291 


GDTjGS PQAPPPG \ S RHS PAS AS R I DGT T 
GARHIMNFKNLAIK 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaJanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KNLysine, LHLeucine, M=Methionine, 
N— Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7037 


20938 


A 


7095 


21 


428 


DKGLT VTQ ARVQW / HDP S S RQ P P S PGLK 
RFFRLSLPSNWDHRCAPPRVANYYFFYR 
WGS YHVAQAGLE LLAS SNT P T SASH SAG 
ITALFLRDTLVRFFMGSFCFPQSFGAAF 
TAQ I PQRGRTEEL F I FLFLND YLFS 


7038 


20939 


A 


7096 


320 


134 


LPL\PKTTFYLFLNRDRVLLCCPGWSR\ 
ILSSKQSTLLGLPKCWDHRHEPPQQAVD 
HTLDFS 


7039 


20940 


A 


7097 


402 


221 


DYRHVP PFPANF / LVEMGFWHVS QAGLE 
LLGSSDPPNTSQSAGITETSHQAQAYTF 
IFIIY 


7040 


20941 


A 


7098 


427 


259 


PPPRFKRFSGLSLLRRWDYRPPPSCPGN 
FFFFF\CI F VQMG FHLVGQAKMLi KLHD 


7041 


20942 


A 


7099 


422 


275 


VQWYNLDSLQPASSGFKRFSHLRLLSTW 
DYRH\PRPANFCIFSNTKDTFI 


7042 


20943 


A 


7100 


3 


404 


HASGATWRNPVSTRVWWHIPIVPATQEG 
EMGGSLEPRK/LKA&VSHCTPVWVT 


7043 


20944 


A 


7101 


539 


336 


FETESRSFTEAGVQWRDLGSLQPPPPGF 
KR / D S P ALAS RVTGT TGAHHHAGL I F V F 
LVEAGVS PHWPGWS 


7044 


20945 


A 


7102 


408 


287 


WLILFFI FCRD / RGLTMLP TLVSNS WAQ 
AILLPQPPKVLGL 


7045 


20946 


A 


7103 


1074 


705 


SFLRWSFTLVAQAGVRWRGLGSLQPLPP 
WFKQFSCLSLLSSWD\YGRPPPRLANYF 
F\VILVETGFHRYWPRMVSNSQPQ/CDL 
PAVGRPNSAGI \ TGVSHCAWPKMFLNNC 
TCTHSPYL.QGIVSLG 


7046 


20947 


A 


7104 


227 


55 


FSQHKINIQFYI SLSPTHI TMS FFLCRD 
/RGLAMLHRL/YLNSWPHVILPPWPPKV 
DRL 


7047 


20948 


A 


7105 


479 


314 


PRLLLXFFFIERGIALCSPGWSQTFGLQ 
ESSCFGLPKSWGPRKEPQPCLALKWTLQ 


7048 


20949 


A 


7106 


37 


143 


NSFF/ CKRRGLTMLSRLVSNSWPQVILT 
PQPPKLLEL 


7049 


20950 


A 


7107 


441 


106 


GAPS PAS LPP CSL I SDCCASNQRDS VGV 
GPS E PGVG YSL WRRFLSRS E KRNIRVG 
VTRFSRCV/LSPLSLTRKGNSLTPCASQ 
VRQ CLAL LRIAHG AHTHWP AP TVWRS L V 
R 


7050 


20951 


A 


7108 


413 


207 


CCIQAGPAFFGGPPPPRWGWGGVPPPGY 
/ MPS P PVFRVGLL VFRGPPPNWFL PMGL 
QPDGPGLGDPKKFAK 


7051 


20952 


A 


7109 


92 


409 


PHLPDGAAGQRCPPPGRGGGCVRAGPQL 
P PGRGGWPGGGLP P P P SRTGWLPGGGAP 
HFS DGAVARQRVS S LLRRGGRAE TLLT S 
QTGSRPGRGTPHISDDGRPGRD 


7052 


20953 


A 


7110 


408 


13 


PPWGPPGGGPPEARGSNPPGPPWGTPLS 
PKKPKETYSGQGGGPLNPPPPEG/SGPGI 
PFSPEGKGSLNPGGQKGKIGAPSPPPGG 
PKKKPPSQKKKKGKEKKEKVIKGESSQI 
LLFKVSKI LSVNFLHLSSL I 


7053 


20954 


A 


7111 


290 


175 


L.KKNCR/DRSHTMLPRLVLNSWPQTILP 
SQPPKVLGLQI 


7054 


20955 


A 


7112 


1136 


552 


I LiFKEMGCEAGQRCS CPVI P I TLGRPRR 
ADHLRCGVRDQPGPTOLKPRLYSNTKtS 
WAWWRVPA\ I PA\ I RRPE ATRFT FNPRR 
WRLPLXsTLRSHPLHSHPGRQS KI P I KKER 
KKGQDRKGKGCGQNWGQKQKRRKEGKRE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-Isoieucine, 
KHLysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /—possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














MGMGRI GGKNKKEGRKERKEGRKEGNKW 
VWAELGAFPEQGKGLRRGPSEEAVGVSP 
L 


7055 


20956 


A 


7113 


570 


199 


L CLL KQAWLMAGAP P PAS L P P CGL I S DC 
CASNQQDS VGVGP S E PGWYSLWRRFL 
SRSDKRNIRVGVTRFSRCV/LSPLSLTR 
KENS LTP CASQVRQ CLALLRLVHGAHTH 
WPAPTVWHSLMR 


7056 


20957 • 


A 


7114 


466 


284 


RTRGIGWTANRRMKATQPGLTFFHLIHK 
QEGGNVGSQKGQRD\VTNSAFHWRLHDQ 
TANCLS 


7057 


20958 


A 


7115 


474 


224 


DSNFEKS S TLGKMLSNS I KCYRE I FCER 
KSQLMQQTSML\FKKLPQSPQPSATTTL 
ISQLPPTSRQNPPPAK\RLQLAEGSDGC 


7058 


20959 


A 


7116 


377 v 


111 


FXXNFXSFLFFSLSLTRKNWVAHITFFF 
FFFLPTPPFFYFIFLLIFCFFFFLFYIF 
FFFFFFSFFFFFFFFFFFFFFFFFFFFF 
FFFFL 


7059 


20960 


A 


7117 


417 


148 


VPPCLANF\NFFVETGSPYVAQAGLELL 
GS SHP P TLAS QS S Q I TR VS HCSLLYFLM 
HSNTS I S FFS FWALVS FLEMPS S CQDGI 
TVLAR 


7060 


20961 


A 


7118 


2 


196 


R VP P P C PA \ NF VF F VQTAF CHVAQ AG P K 
LLiASSNPPASASQSARIAVLSYCPLPSL 
RLRSHLDFF 


7061 


20962 


A 


7119 


2 


417 


EAGGS LE PRNLRLHCA/ LVAPMNSRC I P 
AWAT 


7062 


20963 


A 


7120 


801 


450 


GPKFLGPPGVRIN/RRPGGGGLPRVYKK 
KNYPEDRPIVLPAFFSGGGR/GPGPHDK 
HPLPRKKQKNSPGRGGEPPFFPPPGFPP 
GGERKNPLTPQRKRVHWGQGLAPPFPPS 
LSAKPKLF 


7063 


20964 


A 


7121 


3 


199 


DAWG FAL WQ AGVQ C CDLS S LQ P S P / RL 
ANFVFLAETGFLHVGQAGLKLLTSGNLP 
ALTSQRLRLQ 


7064 


20965 


A 


7122 


407 


164 


NVAGL TGAYLD S FFGRDRV/ LTMLLRLV 
LNSWAQAILPPWPPKVMGLQACTIEVSS 
LVAQ IAS KRLSGVQESDEAAW IRDC 


7065 


20966 


A 


7123 


33 


192 


RWDFTMLPRLVLGSKRSAHLGLPKCWDY 
KHEP / P CCPGWS WAQRDLPTLAS QS AG I 
TSMSHLHQAIPLKFLEQFIRFSILVF 


7066 


20967 


A 


7124 


3 


399 


S I PVDWPSRAPTPRGRESAAHQGW/ YPV 
GQRN\RMAGLSPRS FCWREVSFSR / SHS 
C/ICWVQLGKSAVLPQQSRGPGAHEGSR 
RRGGSSPPCPPLQTQLVFSPQELSMSAP 
I HRLS VTLQGDC I S TGGAPFRCRLA 


7067 


20968 


A 


7125 


1901 


1574 


FFEMES CS VAQAG VQ WHDRGS LQ PL PPG 
FKQFSCLSLPSSWDYRHLPPHRANFFVF 
LVETGLHHVGQ\SGLELLTLGD/LLPAS 
AS QSAG I TGVSHHAWAKNPCFLTS I DT 


/Uo8 


20969 


A 


7126 


408 


234 


LPS SWDYRS P PPQADTF YI FSRD/ MGFTM 
LARLLT S S D P PALAS Q S AR I AGVS HHAQ 
PVYF 


7069 


20970 


A 


7127 


88 


277 


KLSTEYKKQRPEDS FFLFL/ VE IGFRYV 
AQAGLE LLS S S D P P TLAS QS AG I TG VS H 
HIQPPFKK 


7070 


20971 


A 


7128 


3 


140 


QRWWCVPATRKAEAGESLETES SRLQGA 
M I T P \ C S P AWATE QDIaAS 
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SEQ n> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T—Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


7071 


20972 


A 


7129 


1 


212 


PLLFVSPPLLSVSPPLLPFP / EPAASPP 
PSTHSLLPSVQSESLGQFLHPPDPPPLA 
FPAPLKHQDLHISHL 


7072 


20973 


A 


7130 


361 


3 


RNNI S QDIAVI GNDS SNRSGQS KLK I FW 
KGFT ILHAI KDFCDS / WEE VKI SRLPGV 
WKKLIPTLMDDF/EGFKTSVEEVTADMV 
ETETELERLELEVKPEYGTGLLQSRDKT 
LTDEKLLLMD 


7073 


20974 


A 


7131 


421 


201 


NFCILVD/TGFHHVGQVSLELLTPSNPP 
ASASQSSGITGLSDHSQRKSELVLSFNF 
FALFHILLCVMFLAIDRND 


7074 


20975 


A 


7132 


221 


529 


LIKFTIHSFYFIYLFIFFSGDGCFALSP 
RLESSGVISAHCNIRLPGSSDSPDSASS 

VAGI I / cmrhhtqlily/clvet/efhh 

VGQGGVDLLTSGDPPASASQSV 


7075 


20976 


A 


7133 


1 


227 


KGIHFSSANDVKKKTVLTWLNSQDPQLF 
RDGLDGWYHRLQKCL\ELDEVYVEK 


7076 


20977 


A 


7134 


1242 


300 


DFVLPDGRVIP\KGIVCLINIIG\THYN 
PN\LWPTPLEVYDPFRFNP\ENIKERSP 
LAF I PFS AGPRNC I GQAFAMAEMKWIiA 
LTLLHFRILPTHTEPRRKPELILRAEGG 
LWLR VE PLAGTS RGNPRAS SRPSLS DPW 
VLRRPLPPPIFLFLLLLTLPLLFSVMAR 
EPPSPLRVEAPGPPEMRTPPAIEATPKG 
TLQPAGGRLRFLSGCVPILHQVAGHRYG 
KDKMG I LQHPDGTVLKQLQPP PRGPREL 
E F YNMVYAAD CTDGVLLELRQ YLPKYYG 
I W P P P AAPNDL YL KLENVTHKFNKP C 1 M 
DVKIGQKKL 


7077 


20978 


A 


7135 


1 


259 


GTSCILASMLTRHRLRFSF/CLFETEFC 
SVAQSGVQWYDHS S LQPQPPGLKLFS P V 
NFPSSWDYRHMPPCLASASILKLLFTID 
LGFYS 


7078 


20979 


A 


7136 


1132 


900 


CSGVISAHCKPRLPGSRHSPASASRVAG 
TTGTRHHARLNFC I FFLVEMGFHRVS QD 
GLHLL/NLWCPPRPPKVLELHA 


7079 


20980 


A 


7137 


1 


501 


FFLRWSLTVTRLECSDVI SAHCNLHLPG 
SSD S PASAFREAGT T/ GHDAQL I F VFL V 
ETG/ FHHWKDGVDLLTS 


7080 


20981 


A 


7138 


2 


229 


WHS HGL L WS Q P PL \ YLS L S S S WD YS H VS 
PCPAGLCLFLVETGSPYVAQAGLSLRGS 
ASQS TGI IGMSQPAGFWKQL 


7081 


20982 


A 


7139 


255 


58 


IFGKGFPPFFCQNQGFGVPPPPRAFFFF 
FF/CPAKYYTMLPRLWSNSWAQAIHPP 
WP L KVLGLQA 


7082 


20983 


A 


7140 


422 


274 


TMLPRLVSNS \GLKQSSRLSLPKWWDYR 
YKPPCLVYTFFINRPPEVPKE 


7083 


20984 


A 


7141 


417 


60 


VWRFLEELETELPFDPAIPLLGIHLKEH 
KLLYHKDTC\WNQPKWPSMVDWI KEMWY 
INTMEYYTAIKKINAIFRDPLTSPWDPF 
bWKiVJf L.ijJr'ljJilWFAoir JjPLiirj 1RSLDE 
PLHIKGER 


7084 


20985 


A 


7142 


391 


70 


S P P PHP PRKGD YRGGP P / RPGKKRE IFF 
FFFFLVDTGFCHVAQAGLKLLAS SN/LP 
TSASQSAGITSVRHYIQPTLLKI IPQFL 
DILLFLKI FHS FLKS FNFNLGS FY 


7085 


20986 


A 


7143 


405 


304 


FCRDR / SLLMLPRLDLNSWAQG I LLP WP 
PKVLGL 
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nucleotide 
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ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G-GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q— Glutamine, R=Arginine, S=Serine 7 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X—Unknown, ^Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7086 


20987 


A 


7144 


3 


159 


HASE KI WYMYTMEHYA\ SQKKNKIMS F V 
ATWMRLE VN I LS KL.MKNEMMS VNGR 


7087 


20988 


A 


7145 


343 


115 


CE FYFLS C I YVF I YFCR/ DKGLSLLLRH 
VSNSWPQVILLPWPPKVSHQAQPEFYFR 
NKQTKIDLSFEETAADLAYYS 


7088 


20989 


A 


7146 


453 


119 


RCWKGCGLWISLRTT/WREWQLVQPFWK 
TVWRCLRKLKVEPPYEPATPLPGIHLKK 
TKAVAQRDMCTPLFIVAFQHHLLKRLFF 
SRCVFLAPLLKIA 


7089 


20990 


A 


7147 


3 


105 


TIEWGEVIS/SSYSSTMANNEGLFSLVA 
RKLSRPL 


7090 


20991 


A 


7148 


295 


23 


FANI FSHSAETLQTRRE WGP I FMIL/ E K 
NFQPRI SHPAKLS F I SEGE I KS FPDKQM 
LRNF I TTRPALQE FLKKE FMS FPGT WMK 
LETIIIS 


7091 


20992 


A 


7149 


1 


161 


RTRGFFYLDGVRSLAMLP /RLVLNSWPQ 
AILLPWLPKVLGSQTIiAMVAAYTVFV 


7092 


20993 


A 


7150 


1 


219 


DYRCTPPHPAHF/ S IFIFCRQR/ SLTMF 
LSLVLWSWTQAILIiPWPPKLLGLQACTA 
MP I PGVELLWVSGCFVKE S 


7093 


20994 


A 


7151 


177 


34 


ELVSFLSFSLSLSFFPSFLPSFIjPFLSF 
LPSFNPSF\LPSFLPSFL 


7094 


20995 


A 


7152 


398 


40 


MNDRVLIi FHP GWNAVAQ S W FTGASN \ FG 
LKQSSHLSLLCSWNYRHTPQCPASLLAY 
VCM YVCM YVCM YVCRD/RGL TML P WL VS 
NSRPQAILPPQPSKIMGPLLGAVAHAYN 
TSPKGSQRR 


7095 


20996 


A 


7153 


418 


179 


RNS CTQMRNEALPNSGYFPKLKYYYDF / 
CQRQDLAILPRLVYNSWVQAILLiPWPPK 
VLGLQHE P PCLAVI QFLKGNLKI V 


7096 


20997 


A 


7154 


767 


537 


QALLCLPGWS\AVAQSWLTAASNSSDPP 
ACLPSSWDYRHVPPRQAEWSFRVSVSS P 
VKRDPGSPWRRMRPCSE 


7097 


20998 


A 


7155 


421 


301 


HAQLFFCRHW/SFTMLPWLVSNSWAQAI 
LLPWPPKVLGLQV 


7098 


20999 


A 


7156 


410 


269 


AGGPQTGSFFFF/CYERGVSLLPRLVSD 
TWSHVILQPQHPICVLGLQA 


7099 


21000 


A 


7157 


3 


237 


MQIKA/TMLYHFTPTNMTVLGFKKNSRG 
WQARGE IGTLVHCWWGCIMMQPL WETVW 
HFLKRLNKLGWVHACSPNYSGG 


7100 


21001 


A 


7158 


435 


211 


L FLVEMG \ FAVLAS LVLTLE \ PRDSPTS 
ASQRARMTGMSHHTQ PLFCLF VS KFHLA 
KKLS L I WQNLRRAG I CRLS 


7101 


21002 


A 


7159 


3 


220 


SSSASQSGGITGVSHHARPSERQ\SCSV 
AQAWQWCNYS SL I TPTPGLKRS SRLS L 
IiSSWDY\GALPHPSVAS 


7102 


21003 


A 


7160 


1 


172 


FRKVLGRTGSWGLCTQVRVEFMDDTSRS 
IMRNVK\SPVREGDVLiTIiLELELETLRL 
R 


7103 


21004 


A 


7161 


1 


180 


DAAPVLKATARPTRPDRWK/ IDIDSLDI 
HGETP S L Q isN TNTFRAW WC I P W P ATRE 
ADMF 


7104 


21005 


A 


7162 


3 


108 


KKKKTGRFKLV\MQYNRRFVNWPTFGK 
KKGPNANS 


7105 


21006 


A 


7163 


409 


110 


GQHGLTQSLPLSAPPRGAVPRWAPRPD/ 
SGLSLSPLLPTAFATTGASSANRFVSIG 
PRDGNFLNIPQQSQSWFL 


7106 


21007 


A 


7164 


35 


371 


YFHHLSLLLKFLLASMMVFLTVVVRIjLE 
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peptide 

sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E—Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possibte 
nucleotide insertion 














LLFFFLGETEPAFVAQPGR\QGPNLGSL 
RPHFPSVKHFPGLTPPRAGTSGLDPPSP 
AIFGFFKKKRGSPCGPGRAWTPGPRGGP 
P 


7107 


21008 


A 


7165 


3 


382 


YTGPAWWSMLVVLATEEAEVGGSLEPRS 
S KLQ \ Q P PLRHC TPAWAT 


7108 


21009 


A 


7166 


1 


305 


TAVTVDP YHSVY I E VNARPGACMNGGGL 
LMGEQNSLKRKQLVQTIjNKRHGESLAMV 
DSLTLGPKLEEA/AS PPAFARVRLRARQ 
AGRRKGSRLCWEHLVS kv 


7109 


21010 


A 


7167 


383 


55 


RDLSMVRMKSMLAI ALCFTALMGMFNS I 
FYGRWAKL PFPPLSYI QGL S HRNMMGD 
DTTDCS F I FLY I FCTMS IR \ QN IQKIVG 
LAPSRAATKQAGGFLGPPPPCGKFS 


7110 


21011 


A 


7168 


465 


166 


S P QGAQ I KGVG FQKLR P WG P \ PG F P RG K 
PFFWFKKKIiPPQGGGPLCSRFSGGLSQK 
N\ SGPWGE I FPWGQNGPPPS P PGGKETF 
FSKKKKKKKAVDYLNS 


7111 


21012 


A 


7169 


2 


323 


IQGFKTLSEEVSADEVEIVRELGLQAET 
EDVTELLHFNSTACCREIFHERKRQQMQ 
QTSLVSHF\RKLLQSPQSSAIATINQSQ 
P S TLRQD PMP ATRLHLTGGLDNH 


7112 


21013 


A 


7170 


3 


509 


FGTIPLVLCADLNSLPDSGWEYLSTGG 
VETNHKDFKELRYNESLTNFSCHGKNGT 
TNGRITHGFKLQSAYESGLMPYTNYTFD 
FKGI I D Y I F YSKPQLNTLG ILGPLDHH W 
LVENNISGCPHPLIPSDHFSLFAQLEL/ 
YTAFP AP S QRHP P S WQE WKHLQRTAL I 
HL 


7113 


21014 


A 


7171 


1121 


320 


SS CFQTSSHACFVTEFVPGGDPMMQIHE 
DVFPEPQARFYVACWLGLQFLHEKKII 
YRDLKLDNLLLDAQGFLKIADFGLCKEG 
IGFGDRTSTFCGTPEFLAPEVLTQEAYT 
RAVDW WGLGVLL YE ML VGEC P F P GD TE E 
EVFDCIRLHGRPPTPAFLSVQG\VKFIQ 
KLLQKCPEKPLGAGDQNAEE I KVQ? FFR 
PTNWQALLART IQPP F VPTLCGPADLRY 
FEGEFHRAA\ PALTPPAPHSLLTARQQA 
AFRDFDFVSERFLEP 


7314 


21015 


A 


7172 


355 


3 


GPLLSMLGMNILFLTLFLLLPLSMLEGG 
VCGNF/ I TNCCLE IDDNRKVIEDI TAKI 
Q KIiAH VP VQ I WKGW S P D TLFGGW F S S LG 
RFKTLVGIVLVILGVCLTLPCLLPLDVK 
NIQSANV 


7115 


21016 


A 


7173 


184 


2 


EKKVEDDDDDDDFEMESRSVIPAGVQWR 
NL\LHPLPSGFKPPSCLSLPRSWEHRCA 
PLHQCI 


7116 


21017 


A 


7174 


1190 


407 


S L KRR P S ARGS RPMS M YE TGSGQKP YLP 
MGEASRPEESRMRLQPFPAHIGRSALVT 
SSSSLPSFPS TLSW SRDE S ARRAS RLEK 
QNS TPES D YDNTPNDME PDGMGS S RKGR 
QRSMVWPGDGLVPDTAEPHVAPSPTLPS 
TEDVIRKTEQITKNIQELLRAAQENKHD 
SYIPCSERIHVAVTEMAALFPKKPKSDM 
VRTSLRLLTSSAYRLQSECKKTLPGDPG 
S PTDVQLVTQQ \ V J PVCVRPFAKAAKQL 
VTITTKENNN 


7117 


21018 


A 


7175 


3 


136 


AIHLASLKVF YRYRVLSL/ LKRLVSNS W 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso feu cine, 
K=Lysine, L~Leucine, M=Meth ion ine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PQVILLPWPPKVLGLQV 


7118 


21019 


A 


7176 


3 


316 


GIVKFFFFFFLTKCRTVTQAGVEWCSLG 
SLQLPPPRFKQFTCLSPPVSWDYRHLPI 
CLANFC I F \ VE TGFHHVGQMGLELLTS G 
YPPTSAPPDAE ITGVIHRTQP 


7119 


21020 


A 


7177 


529 


217 


LLPRLE YSGMVI AHCGLQLLGS SQAI LG 
ACYCAW/HNFKAFFVETKSHYVTQAGLK 
LLAS SNPPALASQS VG I TG VSHCAQATL 
PFKTVDKGPSFNLFSFTPPLL 


7120 


21021 


A 


7178 


3 


150 


YTLCKS KKD IKSIYHIMT/ HLKTHTHTH 
THTHTHTHKPSTTFSYTETFIM 


7121 


21022 


A 


7179 


540 


69 


ACRGAH VQ VGDGS ALT Y VE DGT ACG P TM 
L S LDHL CL P ALAFN FS T CTD S G S RR ICS 
HHGVTAWSLGWREKLTRGDRPLLTSPGP 
ALPIG\ CSNEGKCI CQPDWTGKDCS IHN 
PLPTSPPTGETERYKGPSGTNIIIDSIA 
GAVLVAAIVLDETGWGYK 


7122 


21023 


A 


7180 


452 


279 


CIDSIYQEIYHLSISILSVYLSIYRSIY 
LSI YLP I FW\SL YLGPRFLRTRDPQLLS 
LT 


7123 


21024 


A 


7181 


501 


183 


HVRCLAFICALIGFPSGYFSIMPWT/V 
DIi VG I E HL PNAYG III C VNG I S ALLGP P 
FAGWI YD I TQKYDFS F YI CGLL YMIG IL 
FLLIQPCIRIIEQSRRKYMDGAHV 


7124 


21025 


A 


7182 


446 


245 


IiRVR S LALNQ VS S AS R YWD TGVATKLAT 
F/SLFWKDRNLDILPSMVFNSWPQVIFP 
SWPPKVLGLQM 


7125 


21026 


A 


7183 


420 


73 


DSSDDSGQSKLKP/ FWKGFTILDAIKNI 
CDSLEDFKISLTGAWKNLIPPLMDDYEG 
FKTRVE E VTAD WE I ARELELE VEHGLG 
AVAHAYNPNTLRLRSLKPRSLRPGQHSK 

TLSL 


7126 


21027 


A 


7184 


567 


174 


FFLRDRVPSLTQAGVQWQDHSSLQPRPP 
RLKQSSH\PALLSSWD\YKHRPLCLAKF 
L I FLQRRGLAI / FARGWS NS WSQ/ CNL 
PASVSPSAGITGESHYSQPFLLSDSQTK 
VYKTHRSTISSTITEMGCERKD 


7127 


21028 


A 


7185 


345 


59 


RNGSSCLWQVIQSLTLLPRLECSGVITA 
QCSLGLVGSSNPPTSASC\VESHCFAQD 
GLELMGSSNPPTLAGITGMSHCAQPLYD 
LFLYEYGLSVHY 


7128 


21029 


A 


7186 


1 


209 


LS WLE IGSHSVTQAGVQWVHS S SLQP * 
MLGLKPSSRFSLPKCWDYRHEPWCLPGL 
LRLKFD P S VGGVA 


7129 


21030 


A 


7187 


3 


225 


E KT PVS DRGAKG C T E S L VNRR P C F SALE 
VDETYVP*EFNAETFTFHADICTLSDKV 
RHFLKP IALVI LNHHI F 


7130 


21031 


A 


7188 


1 


377 


ES * RNHNGDEDDSHVRAQGAC IEAKHDK 
YMADMDEL FS QADDNRKKRDI PDYLRGK 
ISFELMRQPCITPSGITYDRKDIEEHLH 

T*>tr/**'TTUT*\T*>Triinn OT>T rnTTTH/^T TTYMT Ti HAT/"Pit TT 

RvGHrDPV iK^^LlHEQLXPNLAMKEVI 
DAFISENGCVEDY 


7131 


21032 


A 


7189 


1 


359 


S RATDHVQRQGHG * G IHNYSDWENTDWF 
RETKNQS VPLS CCI E TVSNGNGS LAHPY 
DLYAEGCEALVGKKLLEFMMHVIWAELA 
FAAIQLLSMLSACIVMCRKSTDPAYELL 
ITVGTYA 


7132 


21033 


A 


7190 


496 


118 


QNQKFP FE PGTS * LGL YPKKYKL I YHKD 
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C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K~ Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














PKTCMFTAALLT I EKP WNHHKHLSMADW 
IKKVWGTVGHACNQSTFLFWFFFEMES 
HAVAQGGVQWLNLCSLQAPAWATERDSS 
QKRPKIDPHKYIVN 


7133 


21034 


A 


7192 


1337 


342 


ISEATRTFRESIWQFQSSPREDDRNRQL 
S AS GSRQSHPG PR S TPG PG QGHRS S HQG 
CAPVSSAARPGGPSFRLSEGALSRSQFS 
VLVYI P * VAAGQFVKACLGAS SPPSRL * 
GGLGTGHRTPGKEARSWPETSAPPAWHL 
LRSLCLPSRGEKLAQARSQKHLGWKWLS 
KLCPKTKYSWSGPDDSPSVLRDPEAGTT 
RWAPSGNPEEEWLQLKAKYLQNISEVL 
SVLHLLHIKALQLHSTNPFFVNARHLLV 
GQ IS SALTLNNLSQILFLLLRFPFLSQV 
ERHSPLQKTGTTETNLVLFQDTVPMNFP 
SS ILLPSKYSCPALDKEASHFYQ 


7134 


21035 


A 


7193 


469 


130 


S AS WD YRRP P PRPSNFLY * * RRGFT I LA 
RMVS VS * PRDP PAS ASQ S AGCAYS I YLQ 
ER FARLHP ACLAPR YMAS D P L S S LHHS V 
PPGLHLEHEHWLWHCLLPGQI CRPGKLW 
Y 


7135 


21036 


A 


7194 


430 


149 


TWQFLKQLNIELSCDPVTLLLGICPRE 
LKT Y VHT KTRT * AF I VAL CT I VPKQKQL 
KYPS VDEWVI KMWYVHTMD YFS VI KKNK 
LAKHRGSHL 


7136 


21037 


A 


7195 


257 


412 


TATSNKEPLKKVRCLIWRKKDCWARGFT 
ACNPSTLGC*GRWIMRSRDRDHPG 


7137 


21038 


A 


7196 


1 


124 


PL I S GWHP CNP S T LGGRGWR I A* VQ E F 
ETSLGELAEPCLY 


7138 


21039 


A 


7197 


2 


120 


YGI ILFITSEVFVFAGFF*AFYHSSLGK 
KISKKESVIIY 


7139 


21040 


A 


7198 


234 


416 


GVHAE VL I S PGPQDRLRALWI P 1 1 FG I 
LFAI LLVLVF I KKVAKKPTNKVGHP * E P 
GTEF 


7140 


21041 


A 


7199 


372 


3 


VHYPVPQTGSPCCVPHPRGLSTIHPGLH 
SRYTGLGMVAHACNCSTLGGQSGKIA*A 
LEME I S LGNMVRP CL YERTNKQS KPRHM 
GITFGLTFSLTLTDKYSLI ISGFYLPQV 
LISLHPLGPY 


7141 


21042 


A 


7200 


396 


21 


GLKESHKCCAI PACLQRD PLVRVLGKAG 
F*RKSLSISTELQSLFDSPDFSKITGKP 
I KLTQVEHRAGFEWNEDGAGTTPS PGLQ 
P AHLT F P LDYHLNQ P F I F VLRD S DTGAL 
LFIGKILDPRGP 


7142 


21043 


A 


7201 


2 


148 


REPPHHSLANF* KFFVDKGFTLLPRLVS 
SSWPQWLPPRPPRELGLQA 


7143 


21044 


A 


7202 


1 


291 


AEGFRMHALVS QAAGGTS GYKL S VGYMV 
QGPL THLMLGLL I S L I FLSRGS GRTWAF 
SHS * YKT S DLLP CRNRRE V I E FLH YRNL 
HSHICLSVTKTFL 


7144 


21045 


A 


7203 


262 


406 


ILARMTSWNS IKHLILFLK* SSHLS X P 
SSWNYRCVLPHQANFLFLFL*R* SFTLM 
PRLVETVLLIQNSPTKIDSSPDDFSGKF 
YQTLNIVFENIRRGNTSNLLHKNSRILI 
PNLTKTLHD 


7145 


21046 


A 


7204 


2 


247 


•DTVLERSGQGTAALPRSTSLDRDWAITF 
EQIIASLLTEPALVKYFDRPECMMARIT 
SAKNIRMMGKPLT* ASDYE ISAMTG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S— Serine, 
T— Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "—Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7146 


21047 


A 


7205 


2 


168 


FFFLVETGFTVLARMVS VS * PRGLPASA 
SQSAGIAGVSHRAQPALRFLMLENRMC 


7147 


21048 


A 


7206 


1 


237 


GKCAHTHTHTHTRDTHRKAHAHTHACTH 
THT Y VQGC S L S QWS QD C VRMWACH I FGV 
PRE *ASRYTS SLVYKPGCTGVCG 


7148 


21049 


A 


7207 


2 


239 


KVDSEHKLERSSGGVLGRFKGKGSGGLN 
LGNFFASRKGYSRTGFDRLNTEGSDQDK 
EDDGSESE*EYS*PMPALEPCSS 


7149 


21050 


A 


7208 


1 


405 


GSRDNMNVILICFPNAPECVSAESVKNDA 
QLDKYLECTAQEI I *NHWEDVPDIjFHVM 
RTLPSDNMPSLiPPGGELASMRTVIEAVY 
YRLNPYINDDTDSTSTDDMWFNCSSSHG 
VYLDLHRRVQVNFDETFSMHPELi 


7150 


21051 


A 


7209 


2 


761 


QDFGTRFATVLLFYGYFKCTRTLKGTCL 
YNAILYKVCSPRNDQPDVCYNPSFFLLP 
IKTGELLGFLVYASREKRS IAIGD*KDD 
KWPPERI IQYYGPAT*AQDSSWGYRIP I 
YMLN* I IRLQAVLEIITOKTGRALTVLA 
LQE I QMRNAI YKS RLALD YLIiAAEGG I C 
GKFNMTNCCLHIDDQGQIVEDI IKITKL 
AHVPMQIPDIVWKMIVKIPVLFCSVLIT 
GTCSPQSGTPCWLSRSRPSHVDPFRWS 
P 


7151 


21052 


A 


7210 


147 


390 


KTGLSDQMPRAIQLSYS YLS PLLY* * S Y 
KKKKKKKKKKKKKKKKNFFKKGPGN* SF 
WGGEKKSGPGAGLKNPGGGKNTFY 


7152 


21053 


A 


7211 


433 


214 


WGRGJSrALTIHPPWVPFQESETTPSPL*G 
ITLLDNPFSPIPKQPPPPH*SITVPYLL 
SSPQPGFPHPFQAPGLA 


7153 


21054 


A 


7212 


532 


253 


KWINVFHKAA.IMSDFSDRSRHSTLKAS 
E*GFTMIDAIKNICDL*KEAKISILTGI 
*KKS I PMDDF* GFKTPML* VTANVGF 
QRFSKKFYW 


7154 


21055 


A 


7213 


456 


24 


DAQAFSWYSQGYFNSSQGFS ISLPGDTS 
P VKI KRMGGP LFF S * LLKS TCWAGSKTS 
RVPGKVLKTGLPRPR* PVYKDPLGGFYP 
RF 


7155 


21056 


A 


7214 


2 


445 


ILRDLCQRVPTWSGFPSWAMELLVEKAI 
S SAS S PQ S PGDALRTVFE CISSGII LKG 
S PGLLD P CEKDP FDTLATMTDQHRED I T 
SSAQFALRLLAFRQIHKVLGMDPLPQMI 
QRSNIHNNRKRRRDSDGVDGFEAEGK*D 
*KDYDTF 


7156 


21057 


A 


7215 


203 


3 


VRPGWAHACNP STLGG*GGRI A* SQE F 
ETNLGNTVRPCRWSGEVLNRKTVSIRRA 
LRVFSPLHTA 


7157 


21058 


A 


7216 


1 


285 


GKCAHTHTHTHTHTHTHTHMHTRTHAHT 
HTHMCRGAV* VRGARTVCTCGSAP FLE S 
PGKEPKDTLRPLFVSQAARAFLGEKAFC 
SVYRQREPPPI 


7158 


21059 


A 


7217 


1 


386 


GUh>XvL.JijWh> VIjY VI 1A1 i'GlGv A±jGILSW 
TVI CMLKRQNGKPKMKSKYKNLDATDQE 
SL*LKPASRAGIKQKGLLLSSSLMHSES 
KLDSDDAIFTWPDREKGKLLHGQHGSVL 
NGQTPLKATSPREEIL 


7159 


21060 


A 


7218 


70 


379 


NILYFNIFFFFK*FLKFLWVFFFFFFF* 
VFFFYNFFFFVFFLCVFFFFFVKLKIFF 
WGPFFLFFRFFFCLGFFGTFQSWRWFFA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L~Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LFPWGLMFLFPLLFFLVFF * LVFF 


7160 


21061 


A 


7219 


1 


208 


PRLAAS VS PGDMLGMQVLGLGTVAS ACN 
C S TLGGRGMG I T * G * EFETS L VKKARTC 
FYKRLKRQKSANL 


7161 


21062 


A 


7220 


455 


96 


HYLYGSQGLSRFYFVTQGRVQWHDHGSIi 
QPPTPGVKQSSSLSLPKCWDYKCEPPCL 
ARPCPFFFFL*TSSVDTGLLLNQQPIL* 
IMTFPYPLLRKSLSFHHQIYEPTSICSH 
LLSFCYT 


7162 


21063 


A 


7221 


201 


319 


I YELWPGPVAHACNFS I LGG * GEQIAQV 
QEFKTSLGHLG 


7163 


21064 


A 


7222 


307 


3 


NRRQAG P F PAKL.GR P S P KGG F PN F FKL F 
FFKSSF * KNPKGQGWGFPPLI PG FQGPQ 
VGGSLGAPGLKPPWGTPQNPFSKKKKKK 
RSNCTDTGTTIAHRRQM 


7164 


21065 


A 


7223 


331 


141 


SSTITANVSPNIi*DANDVPIQCEISPLI 
S YAGE GliE R YVAD KE FHAP V I IDENGVH 
EWKNGI 


7165 


21066 


A 


7224 


157 


314 


WLPGFRDGGEDGVCHVAQAGPKLRT*GD 
PSIKASQRAGITGLTHRTGPK*KF 


7166 


21067 


A 


7225 


2 


392 


RDVRSLQPLPPSLK*FSCLSLIiSSWDYR 
CAPPTT*LI FILLVEMGFHHIGQAGLEL 
LTSGDPPASASQSAGVTGVSHRTRPPSF 
VLVNPIPPLLPPVPSPDGPNISSSTTSW 
RGYPEVLRWGLCSFWGCG 


7167 


21068 


A 


7226 


84 


204 


FIFFFFKR*SLTMLPRLVSNSWARVALP 
PQRPKVLGLHV 


7168 


21069 


A 


7227 


3 


307 


DRAAPRGAGRAGAVGPHRTGDWGIAGTP 
AGRGHARH I GRG P KQGRAAG QRGHLHP G 
GLLGRGGNSEP * RKQRTPGSMRRAAPLL 
STGQVWEGPASVGDRVP 


71.69 


21070 


A 


7228 


52 


1176 


GS QQ P LLH I S E AQGDTL I KKAP AE AAWP 
PLIAFSSGGRLPCSHES*SAAGYKRLDD 
TAAGGPPSPGGSPRSPATAPGLHPVWGG 
QLiRPRGGRGLPATGSGEAFQALTQLSRG 
RACTGPPAPGSEDAAPSGCSAARWSP*M 
AGSSSCAIPDSAASCICPQGPPWRW*SR 
APARNRCSASSGRSRC*QLRRRRYIPGL 
LHPVLHEALPDS WGCE TQLGPPLAS EAA 
PPRSRRREEG*RGSGRC* *KPSLPRSWS 
L CRNQAE AGF VS I LVW L S GLAGAP S RP A 
AAAPQ I APNQHRS P P PRLAPVPAASGS P 
WP WERDLG VLETLNVGNFAQD PNPS ACS 
GPQSTPDTYYPVQKPKTGLRLHFHSRPN 
TIVKSVGMMCL 


7170 


21071 


A 


7229 


3 


112 


ASNLGSRGCSEPRPCHCTPAWV*E*DSI 
SKTNKQKN 


7171 


21072 


A 


7230 


103 


299 


AMS Q I RNAI P FT I AT * RL * YLE I QLTKE 
AKDI YNENYKI PLKE IRDVTNKWKNT PC 
S*IGAQIiCK 


7172 


21073 


A 


7231 


2 


137 


SLTLSLSICFCL*RWGLSVLPRLISNSW 
PQAIFPPWPPKVLELQV 


7173 


21074 


A 


7232 


3 


273 


LTHIVDGPNFLTTMYQCIjMCVC*QELGY 
QP VTTERCHNVCNDCLQRS FKAQVFS CH 
ACRHDLGQNYIMIPNEIMQTLLDLAFPG 
YSLGR 


7174 


21075 


A 


7233 


338 


136 


RPRWEDCLSPGVQNVCGQHSESPFLQEI 
* KI S QTWWCKP WPS YQEVEGELLE I RG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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NO: in 
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09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














STCFGLPKCWD 


7175 


21076 


A 


7234 


437 


261 


HLGFP E C WD YRHE PQHPT* FLEHFM * RR 
GLTTFPRLVWNSWPQAIFPRWSPKVLEL 
QT 


7176 


21077 


A 


7235 


41 


264 


VTVME I KGKI NLVLD 1 1 KD I YVEGFS T Y 
YF*RQGLTMLPRLDLDSWAQAILPPQPP 
E*MGLQECNHHAQLSRYF 


7177 


21078 


A 


7236 


1 


154 


GTRDFS VKTL *ARREWHDI FTGMKEKNF 
YPKKVYALKI PFKNEAETKKVE V 


7178 


21079 


A 


7237 


1 


179 


SCLSLSSS*DCR*PPPCPAUFVKSFFRN 
GVLTMLPRLVSNFLPQANLLAQPLSGFI 
ILL 


7179 


21080 


A 


7238 


340 


140 


KINWVWWPVPVI PVTPEA* AQKLP* PGK 
LKFPLT* IPPLPSRLGHQSQTSFPKKKK 
KFYILSSDSV 


7180 


21081 


A 


7239 


2 


175 


QFE S KP S P AWATEQDPVS KKKKKKKKKK 
KKKMKKKNS SKFKM I PFTKRQ Y * EGNWK 
N 


7181 


21082 


A 


7240 


336 


192 


QSE TPS QSNQKF I F * GQG I ALL PRLVLIST 
SWAQAILPPWPPKMLGLQA 


7182 


21083 


A 


7241 


1 


186 


DRPGQHGEMLSLL I FCFFCRDGGLTMLP 
RLVLKS*AQVILPSRPPKALGLQDTVHC 
SKFDAV 


7183 


21084 


A 


7242 


2 


111 


VKTLNKLGIDGTWLRIIRAIF* KPIANI 
ILNMCSFL 


7184 


21085 


A 


7243 


360 


217 


LP S RGDYKRKP P C LVNF YFL * R * GLT LL 
PRL I SNF WAQM I LL PQS PKC 


7185 


21086 


A 


7244 


35 


276 


VQQQRVKQGRKYYS IVLKNKKKLCNYLR 
VGE*MG*RNIGWVQ*VAHTCNPSTLGGR 
GGRII*PWEFETSLCNIG*TCISG 


7186 


21087 


A 


7245 


1 


356 


IQHTFSLLIFGIKYSYYVLKIVSWLAAV 
AHACNL S TLGGRGGW IT* GHELE TNL F F 
AVKPWVYDKNSFPPTLNLRFAKSGHLWE 
PRGKNHGPPETPPLSRVS PPQRFFPGGT 
RPLKKE 


7187 


21088 


A 


7246 


3 


293 


CKCGKAFHNF YS FQNHE I S HTGEMLYD I 
RNVGK* S IFF*HHCQHKTTHRAEKLCEC 
KICREAFSHFGNLKVHEAGRSGSRL* SQ 
HLGWAWCLTSGI 


7188 


21089 


A 


7247 


6 


387 


AS I P CLT YPS WE F L * F I * QAGH F S FS KK 
KLFICAFFSNRDEVSLMLPRLGLNSWAQ 
WLLPWPPKVLGLQVRATTPAKLGIFWG 
FMSVSLGQGVCLCRSLLCPLPLGPGQVH 
SGCSQS FVEKMNKEL 


7189 


21090 


A 


7248 


373 


146 , 


HLGKKVLIFWPQYLALLNPPNGGVSRVY 
PRCRPLMVFFFFFFFC*GRGLAMLTRLV 
SNSWSQVI FQLQLPKVLGL 


7190 


21091 


A 


7249 


2 


184 | 


AILRGX*KKFISTLMDDFEGIKTSLEEV 
TAD WE IAREVELEVEVEPKDVTELLQS 
HDTT 


7191 


21092 


A 


7250 


795 


544 


EMKSCSCCPGWVQWRDLGSLQPPPPRFK 
RSPAS AS LVAG X TAACHHARL I F VFL I D 
TGFHYVG* AGLKTPEA* ATAPGP I FLYF 


7192 


21093 


A 


7251 


34 


242 


KNTSLKKKFFWCQKLRNVSLVEEFAFWP 
GAVAHACNPSTLGDRGGRVA* GQE FQTS 
LSNIGRPRLYRKI 


7193 


21094 


A 


7252 


216 


57 


NIQVLKYVLTSYLKKKKCAFRLGPVAHT 
CNPGILGAQGRRIS*AQEFETSLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PnenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i— j. nreomne, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7194 


21095 


A 


7253 


126 


366 


INLTECQQYLDKAI FFFFLKKGLFAP * A 
E WQGGS FAS LQ P KPPRLKKS SHLTLLS T 
WDYGGALPCRANFGIFFFWKKGG 


7195 


21096 


A 


7254 


2 


213 


HP 1 WAX WyJNiPAb I KKK.KIN b b b b a xh,b RT 
VAQAEVQGGNLG * LQSLPPGLKRFS CLK 
MPEAPKGQSCSLIG 


7196 


21097 


A 


7255 


150 


263 


EGVSLLPRLACNGAILiAHCNLRLPTSSD 

T ft 7\ P C* ' L " 1 It 1 ' LIVWtrCT'Vn'PPTT'nWT IV/tT T" f~* 

ijCJAr b* i. I lKixJ\.Xbbitr bHPNIjMJblC. 
SRFTLIiSSHSSLGHLTLFLLIVCFEKES 
H S C P GWRAM VLS WLTATFAS RRQAT FAP 

CJT T7T OTD 

bJbiiijbK 


7197 


21098 


A 


7256 


386 


93 


P PGGLP PQS IPS PRAQLKPKS PRGKKRG 
GPKPGGSQPGFWGFFFGPGPPFKRGGGL 
JN WV JjCjArUN * iSXj V IVKIN y KK-ilKOj J\Jyl Py £1 1 K 
RGNRS IDMKFGGS 


7198 


21099 


A 


7257 


153 


369 


KENSRLQGYGPLFSHS* FTNSCPGQVQW 
P VXiaALVViiAiii V(jCjbKuyP<jyHCj.KrPR 
LLKI Q KFARCGGGRL 


7199 


21100 


A 


7258 


180 


3 


CSTLI * IRKWLGAVAHAYNPNTLRGRG 
GRIA*GQVFKTSLGNNVKTCIiFIiPS PHN 
QQ 


7200 


21101 


A 


7259 


22 


265 


QLRAIPDLKLDPL.MEGKEILSRTLLGSG 
TVAP I CDPS TLRGQGGW I A* VQEFKTSL 
(jJN!VlAJxPHijiKKJK.i\.ijijoy V 1 is.J_ii!»xK.W 


7201 


21102 


A 


7260 


326 


2 


KSAMQ I CI VDPLNAFCLE I CCHANLHCR 
PSNAFCFLKVDNVRAFVS P S I PQVS L I L 
KNFSWPGWAHTCNSNTVRGQGERIA*A 

r"AT?"E'Ti"T , TT f'NTT , r 1 T5 T3 fT VUT TTT\TT\T"D TTO 

y\c»r Hi 1 JLIA3JN lUKFL.JuxrbAJ>li\K]\o 


7202 


21103 


A 


7261 


1 


68 


VNLC F I LWLGPGWAHACNP ST* GGRGG 
RIA*P*ELETCLSNTA*PGWAHACNPS 

rp 
1 


7203 


21104 


A 


7262 


282 


176 


GLKKTFFFLVRERVLLCYPGWSAW* SQ 
LTAN* TPGLKLFS CLSLPCS W3STHRHVAL 
HVAENI C* CHMP WPATWEAEAGEQLE P 
K^jo V l^oJilj 


7204 


21105 


A 


7263 


588 


351 


NNT VCLLES I S S VNLCLPHLQHS PVKLE 

TT?T,T?'C'C7C!Q TiM U 1 "D C TUTU T T\ TT> "D T CDC •* 7\ T 

IFPVTCTYTSRWPEATKDPQKK 


7205 


21106 


A 


7264 


327 


2 


KI KER INNFI FI CRDRVS I CCPGWNAW 

rnj .tvTQTaTAOZI TT3WPT.CT.WIQC * mvd r^'A Orr 1 U r~"* 

QVTT I FFCRDQ I E IHMVP KYHLGNTLNY 
FF VLFCFVLFLRHP FSAHRNFRLP 


7206 


21107 


A 


7265 


121 


270 


AQEAGFELLTSGDLPTSASQSAEITGMS 

trPaT.TJQr'V* "E , V^7T , T.TTr^.T. , T 1 ^7'2N. (~*7\ 


7207 


21108 


A 


7266 


3 


227 


S S Q VS L PNNWD YRRE P P CLRF VFLF F VE 
TRSCYVASLELLGSGDPTVLAS *SARVI 
DMGHCDWPNLSEILLHVR 


/ZUo 


111 Aft 


7\ 


7267 


300 


1 


KiYlijijJN 1 XiYlL-JNKlilijL-iSi^AbPRVyylbJt 1 V 

SYFKKLPQPPQPSATTTLISQQPSTSKQ 
DPPAAK*D* FAEGRLIiAFPSNWLYQLR 

VTMr'TH'T ,PT-TTnT\7TTT .\7T^[ 
x J_J. v i\_.iT i-j Jr n ± \j x v ri-Li v in 


7209 


21110 


A 


7268 


29 


344 


ASLLALKKKSGLGAVAHAHNASTSGGRG 
GR IT* AQE FAT S LGNMART CL YKKKKKK 
KKKGKNFTGVGGPQRGAPI FGPWGGGKS 
PAPGKPGGPNPFPGKGAPTLF 


7210 


21111 


A 


7269 


3 


209 


LHPLSPGPPAPVAAGIPGPQASR*GPCR 
PDPSPDSASVDASLPGPALSLWSWGSNR 
GEYGTTPASQIH 
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SEQ II> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E-GIutamic Acid, F=Phenylalanine, 
G==Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M— Methionine, 
N=Asparagine, P=Proiine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaJine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /—possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


7211 


21112 


A 


7270 


86 


206 


CQPG WAHAYNP S T WGGRGRQ IT* GQE F 
ETSLGNTARPPL 


7212 


21113 


A 


7271 


45 


274 


GLGTVAQACDNPSTLGG*GGWIA*GQBF 
KTINNDKSNKWQYLAIIEQINNRMLICK 
NKKELTD I CNNTDE FQKHRE 


7213 


21114 


A 


7272 


1 


122 


VL VE TG YLR VG QAGLKLLT PGD P PALAS 
QS AG I TGI SHRA* SYMHF * LVETGYLRV 
GQAGLKLLTPGDPPALASQSAGITGISH 
RA 


7214 


21115 


A 


7273 


3 


242 


LLKQLD YRPPHPAKFFFFLE FLVKKGFH 
LWKGG F KTQT S GGIi P G * AS KG VG I PGV 
S HCPGL I G VFNL TIi KVAVS LMDL 


7215 


21116 


A 


7274 


2 


169 


IiALVAQAGGQWHDLGSLQPLSPSFPQFF 
YLSLPS S * DYRHE P PHS AWFS F FSNYR 


7216 


21117 


A 


7275 


353 


76 


FAHLGLPQCWDYRREPLCAT*KLFLKGK 
* PSKFQVI FSSLGTGHFSNACFS CVMWT 
IVLLI INFWTGIDGRTSGSSSVKCGNH 
SIYPMVLL 


7217 


21118 


A 


7276 


6 


87 


RRGLTLLPRLGSNSWPQAILLPWPPKVL 
KL*AQTPGLRQSSCLGLPKC 


7218 


21119 


A 


7277 


1 


203 


FQVKMLCSGWMWLTPVIQALGAVAHTGN 
PS TVGGRGGQ 1 1 * S * * FYTS LDSGMPPA 
SASQSAGITLA 


7219 


21120 


A 


7278 


272 


51 


IFFPW* CGGHD VQHHFNP S *NNLSSWS 
S WFMP 1 1 L PL WEA* AGK* LE PR S LRPAW 
ATKLDSNSKKKIIEIHA 


7220 


21121 


A 


7279 


1 


131 


SRDYRHGPLHPTDFVFFVEMRFHHVAQA 
GLKLLGLSDLPALTS * SRDYRHGPLHPT 
DFVFFVEMRFHHVAQAGLKLLGLSDLPA 
LTS 


7221 


21122 


A 


7280 


411 


36 


FLECVPNFFPPKKKFFFKKNTGQVFFPV 
TFL*KGPALFFFIFPFYGKKFFFLSSAK 
DFPPRVFF*KGPLFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFGFKHFIYKKVHF 
*SSVHFQPIIFF 


7222 


21123 


A 


7282 


395 


181 


RKHES IHHDSVISVTFNMQSLRLFFFIF 
FILSL*LFRFIFFFSFFLVFFFFFFCFF 
FFFFFFFFFFFFFFFFF 


7223 


21124 


A 


7283 


392 


2 


IKFFGYOT*FNYILKNGLF*QLLI*LIN 
LFILRQGLIFWQVCNVLVFFFFFFIFF 
FFLFFFFFFFFFFLLFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFLTQNMLFIVN 
F C KDT F KQ QN IHLRTRGR 


7224 


21125 


A 


7284 


113 


291 


YVCVCVCVCVCVCVME Y* SAI KRDKIVA 
FTVTWMELETI ILSEVSQEWETTHHMLS 
LIN 


7225 


21126 


A 


7285 


327 


44 


LIKVSPNFSPPKKKVLPKKPPCTIFPPS 
PF*KTPPEPIFIKPPHRKKNFNSPPPQK 
*GPPQGFFKSPPPFFFFFFFFFFFFFFF 
LGFSSKLLIR 


/226 


21 127 


jf-i. 


7286 


3 


215 


V I L I X FM I * EAFAW KRKV L I V E EPS INL 
E * L YD GG Y YDGLVLTS GLGP Y 


7227 


21128 


A 


7287 


367 


119 


LKKAPAPPFLNPPFWGKNFILSPPEKMG 
PPRGFFKRGPPFFFFFFFFFFFFFFFLG 
FFFFFFFFFFLQYN1NJMYFH * IL XINQ 


7228 


21129 


A 


7288 


73 


259 


INKYLQN* KKKKKKKKKKKKKKKKKGGP 
F KKT PRGAQ I S PGVKKKNF P P KGG F KKK 
QAPGPF 
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nucleotide 
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SEQ ID 
NO: of 
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M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysme, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 

ITS 7 T* 1 1 t r nr> • 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, -=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7229 


21130 


A 


7289 


337 


53 


IiRS FAP FHLP I NNLLTD I HM V I P LP P S S 
LCSHYIiFNEVTVTTLFKTTTHLQLGTMA 
HTCNPNTLGG*GGRSA*AQESETSPGNM 
AKPRLYLLNF 


7230 


21131 


A 


7290 


1 


96 


KHHFGFEAAA* YWHFVDW*LFLiYVS I Y 
**GS 


7231 


21132 


A 


7291 


288 


2 


S R VE FKGP FMAHCS VT I P GS S YVFTS AS 
* VAGPHGARHHAC* FLF*FLV*MI»LEMG 
S RHWQAGL * RVS S S D P P ARACQ S GG I P 
GVKPHAQSVSV 


7232 


21133 


A 


7292 


121 


233 


YFPKKKKKKKKKKKKKKKKKKKKRKKK* 
KGGGRSRKSYLG 


7233 


21134 


A 


7293 


3 


346 


NPLNPLVNLiTVS PKRNSSLDTRKKPCRE 
RHREKETLRKKRNQHTEARGRQRENHPE 
TE*ETGQRDRDKQTKKQKDTDIQAYTYT 
DRE KRD KKRHRQ I R * AE AE R I RDTERQR 
DI 


7234 


21135 


A 


7294 


46 


346 


ETGLKKWGDLSNGTVRAVLNLAQADFCV 
SIPS YH I KHGNT KNN CGGEGKKK I LWT 
AKKS EPG WTHACNPNTLGG * GGRVT * G 
QWFKTGLGNIVRPCST 


7235 


21136 


A 


7295 


254 


12 


RKDAENNGSRAE * I KRVKDLVQS KLKFF 
QELEKEGILPNPFYEASITLIPKPDKDT 
S KED YRP I S LMN I DAKILNKI LAN 


7236 


21137 


A 


7296 


258 


1 


C I GQ C VLVAVGRE LL C VSRS W CGLLHVD 
EVGGEALGRLLWYPWTQRFSESFGDLS 
TRDAVMGNPKVKAHGK* VLGAFS DGLAL 
LD 


7237 


21138 


A 


7297 


191 


263 


VCSKCE P F IYIMDYYAAI KRNKI MSFVA 
RWVQLE WSELTQEQKTENNMFS * WFG 
FLLLC 


7238 


21 139 


A 


7298 


2 


164 


S RVAG I TGAHHHAQL I F VFLVETGFHH V 
GQAAFKLLTSNHHARPQ*LHFFYFFK 


7239 


21140 


A 


7299 


307 


8 


VMIITFWLPHLSGYIEKSTPYECGFDPI 
SADRVPFSIKFFLVAITFLLCDLEKALL 
LPLP*ALQTFNLPL,IVMSSLLLtI I ILAL 
SMAYE * JjLtKGLiDCS E 


7240 


21141 


A 


7300 


205 


317 


KSSPACTPSTLGG*GGWFA*VQEFETSL 
DNMVKPQLY 


7241 


21142 


A 


7301 


140 


22 


FNFLFLYP *NYFFFFFFFFFFFFFFFFL 
FFFAMNINVFY 


7242 


21143 


A 


7302 


382 


151 


SIHRSEFRPRRSHSVSRPESRGIILVYC 
SLYFLGSGDS AP WS * VAGTTGVCPWAS 
L I FGFFFFFFJaTRDG VLLS CPG 


7243 


21144 


A 


7303 


318 | 


17 


LVPTKRPQVENYYPVHTYIWYCTKGKRK 
WAGIGGSCLSSQHFGRPSWQDPLSLGV* 
DQPbQHRfi I PbLQK* KNWWQE P I V PATQ 
EAEAGEWCEPGRQSLQ 


7244 


21145 


A 


7304 


310 


132 


QWCDLGATTPANFC I FFVRNGVS P CWSG 
WS * PPDLKLS ARLHFQSVFFNP IIS IGL 


7245 


21146 


A 


7305 


338 


168 


REPPLPAANFVFFVEMRSHHVRQAGPEP 
PSSSDLPASASQSAGITGVHCHAWP* *T 
YK 


7246 


21147 


A 


7306 


276 


161 


I INAAI KKWEFMS FAGTWMKLEAI ILSK 
LTQELKVK* KLRKFNTDTILLLLL I FKI 
F 


7247 


21148 


A 


7307 


310 


61 


ITIiVYCNFFYFINFNFLKLFDSFVIXLS . 
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NO: of 
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sequence 


M 

eth 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-lsoleucine, 
K=0Lysine, L— Leucine, M=lVIethionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R— Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














LKQIYSCTAIVYILIL*AFVQFKFLCFY 

TTT VCVTiTT ITTVITNTT TTWT'K'V\7*'KrMV'U'V"I? 
J? LjX r l\vi 2.C JL X J5JN J- r jl SXJ.iS.1 V ^lNlYl X c x V 


7248 


21149 


A 


7308 


257 


1 


CGRVDLH*AVFFFFKRWSLTVLPSLDLD 
SWAQVILLP*PPE*LGQQSCTTPPSSLS 
T FL YAFLH I P VNKM S C F VGKDRGMF FS Y 
H 


7249 


21150 


A 


7309 


2 


181 


RDGFCHTGHTGL*LLTSGDPPASASQSA 
GITGVSHHARPCLHSLTAFFKNWMVWPG 
QQE 


7250 


21151 


A 


7310 


25 


340 


FC VFNTMDVKNI * I FLI FVPDVHLG * QK 
NLLLDT I * KKKS IRIQE**QYKSIS KRN 
YNTVS DKS WLGTVAHACJSTP STLGGQGGR 

mTv -t- rn/* , \'CT"ni/~\rric'T TMvTmx 7"n n /""it T~\r\T 

I A* lQhir QIoJjDINI 1 VHFLvi x 


7251 


21152 


A 


7311 


2 


193 


FVDFSMHILDMLSLYLSIVICYITLFLS 
J. J-DJN 1 Y VJnCjPHK-L IlFYl? x LlKi Xr ill 
IWFCFWFFETVLLCCFSVFTFNDYCSYL 
HYFLLLGVFGLFHFF*FLYLYMVLFLVF 


7252 


21153 


A 


7312 


1 


245 


RTPGLKQSSHFSLLSSWDYRHVPLHPAN 
AFFFFKGP*SEPLTLASSLHWYFPGGHF 


7253 


21154 


A 


7313 


167 


2 


GKPCLLWMCRDSQKCGEKGTGLKREAL 


7254 


21155 


A 


7314 


334 


152 


CHVAQAGVQWYNHSSLQP * PPGLR* FSH 
LSLPSGWNHSWTPPHPANFFIMGSCHLP 
LINK 


7255 


21 156 


A 


7315 


2 


164 


VSHHARPTFYLS * QVGLTVTS FRNPQ 


7256 


21157 


A 


7316 


155 


9 


QRIKH* FHWLGAVI LTCNP S TLGGQGGW 


7257 


21158 


A 


7317 


1 


293 


GIPSTWDYWCTS PQPHF*FFFFFF* * KR 
DijADaijPGGVXjJNo W 1 PAIXjQ P KGPKa/IjGL 
QAWTI L YKGYYGKEMWHKE I CKGTHD VL 
SKLLTTKRFKRLL 


7258 


21159 


A 


7318 


262 


73 


S N S L V YLi I KPD S VY I FLL F VE T VS S HV A 
QVGLELVASSSTPI*ASQSVEITGVSHH 
ARPKKVT 


7259 


21160 


A 


7319 


169 


265 


GGRMALSPGRGGCSEP*GCHCTPAWETE 
QNPA 


7260 


21161 


A 


7320 


169 


260 


KPQKI P VACDQQ FTDKRNCS HGD YE CNM 
VDFFFFFFRERVSLCHPGWSAVV*SGDH 
TTALQPG 


7261 


21162 


A 


7321 


580 


400 


FFFFERESRSVARQPGVQWRDLSSLKSP 
PPG F KRF P CL S L P S S W DYRRAQ P CP ANV 
FEFW*RWGFTTLARI VLLS * PCDPPTS A 
S QS AG I TG VSHCAQ P * FF F FE RE S R S VA 
RQPGVQWRDLSSLKSPPPGFKRFPCLSL 


7262 


21163 


A 


7322 


130 


26 


SPGPLFGLIEPLPSRFKGFFWLNLP*TW 
E YRLAL TAP I KFFKFL I KTG F PH * G I KL 
RG 




1 1 1 


A 


/JO 


1 7 
1 / 




FRSQF * YLKKQLQPGMMMHAYNPS YS KG 
*SRRIT*AQEFKSRLGNITKLCCLKINN 
SLIFKITENIE 


7264 


21165 


A 


7324 


194 


8 


D WVE VL KKS T YRP D T VAHACN P S T LGG * 
GGS IT* GQKFE IRLGNWGP CLKKNLTN 
QLGVGG 


7265 


21166 


A 


7325 


413 


72 


GVP I FLKTLTGKTI S SERESKVPLEKGK 
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nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C~Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M— Methionine, 
N=Asparagine, P=Proiine, 
Q=GIutamine, R~Arginine, S=Serine, 
^Threonine, V=Valine, 
W—l ryptopnan, Y=lyrosine y 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ANFRDKRGI PPHHKRVI FAEKKL*KGRS 
PFDDGFQKKLTQPRVSAFKGHPPLQPLY 
At IKbr KiLLjlHHQDCP.b WHIiUjIiSARRQ 

Q 


7266 


21167 


A 


7326 


394 


177 


NVNHTHTHTHTHTHTHTHGWCKSDTGYS 

bDb^Cx, 1 FKCxrtUlr 1 Wirbbby X ViNbAJYlG 
LTLAKEVXiANMTQTEV 


7267 


21168 


A • 


7327 


232 


22 


SRPSGSKKGAPALQQSSCLSFLSSWDYR 
HKPPYx7rNFCIFL*SQSFPMLPRLLSNS 
QPHSNLPKWELQV 


7268 


21169 


A 


7328 


232 


3 


YHRSSGSLISLIQIRLRSKI *APSIPII 
KEWLS S VAHVYNPNTLGGQGGRLT * GQE 
FET S LGNI ARP CLHENCLKN 


7269 


21170 


A 


7329 


364 


3 


PKARRATVRRTRVS KAPSNRQCLGVNTH 
VLGHRRP VKGNAKTRWSASE SQAHTCNI 
T TLGGQGGL IT* DRE FES S LATMQNP VS 
VSGWLQKMFSKQKLYSKFFPNVAYKAFR 
IWLLCARA 


7270 


21171 


A 


7330 


177 


2 


DPHLHKKFKARCGGMTL*SQLLRRLR* K 
DQLSLGVGGCFELLWCYCTPAWVTERDP 
SC 


7271 


21172 


A 


7331 


413 


40 


RHMYLMFYTYIYVLDIYTYTVFFMYIYI 
P*CVCKYISCLHIMYRFISMYPCVYVYV 
CMC VC1WI HVI VENAGS CLAIj SALi IEME 
GMARDS KMWT FNP D E KMRLS YKKQ LE KL 
NFMILCPTWKEF 


7272 


21173 


A 


7332 


147 


49 


KNYIYFCRRGLPMLPRLGLNSRAQEILL 
¥ U P T iiLrijN b KAU -hL JL Jj-L P U P FM 


7273 


21174 


A 


7333 


148 


361 


S WLGT VAHAYNPS TLGGRGRQ IT* VQE F 
ETSQSNIARPLLYSVLAYAVQPPLPMVA 

MDVT MT TD r PC!'DTT'AT 

jyiKJ\.bW.b±P IbKl 1J\L) 


7274 


21175 


A 


7334 


305 


2 


CPGGFVTRVHR* YVRLSQPDLRVFTSGA 
MPASAAQSAGITGMSHRARPVCS 1 1 IFL 

JW±±f JK.V iC±r v±LLi 1 ± i/iMUWKUivl 1 i? LibSx V 
ASCRILASRPTPIWVL 


7275 


21176 


A 


7335 


1415 


1164 


FFLRQSLLiYCPGWSWAQSQLIASSITQ 

7V -A- 7\ T T t)DPaT*TriVO Q/"YLJT l r»TT T< r rPTrTr , /~*4-1~\r^ "C 

A* AJ. LjPFCoWJUx bbyHI Wx-ilxr r r L.*lJGr 
SMLPRL VS GS WAQMVLLPQL P KAQAKL 


1 L /O 


21177 


A 


7336 


1 


206 


r -1 TD 7\ TlXTT T\ZIQf~''E 1 /~\Tlf\T~)f" , f~'LI < PTTT'T C T T VT f~\ 

tj X K/UJiiijl v ]bvjr wJjy .rourrl 1 Jti 1 JjbJ_ililS.i.U 

RSQAWWHVPVMLPTR* I DARE S LERLMT 
IALLKVRNRYIF 


HOT? 
//// 


O 1 1 TO 


A 


(55 ( 


65 


302 


VC OT TJVPT IT" T "ETT'f 1 TTT-TVT TTV OT/" 1 T -Jr T T OCH 

i\.ob>iar x r -UrL-Lii IQjx* Jrix VAy IGJj^iai-ibbJJ 
DPPASVFQSAGITSMSHHTQPKELLKST 
RNTLEE PNREWAKAREWAHRRNT 


TT70 


Oil TO 




/55o 


208 


38 


GSLQPTPPGLK*FS*LSPRV 


7279 


21180 


A 


7339 


2 


198 


ARADHP RS GVRNH P GQHGET PVS T KNT K 
N*AWWRVPLVPECWDYILFYMFTNCMDN 


7280 


21181 


A 


7340 


2 


227 


ARGE IMS FAATWMQLEANILSKLTQE* K 
DYLGCS TQAVLKNLGTRAE 


7281 


21182 


A 


7341 


3 


300 


HEGRNYTLTGRDSCTIiAASAEKACGACP 
LWGKCDAE S S KCVCRE ASDCEEEG I S I C 
VDVNCKEQTMPEC * AVAL * CTVHL I YVT 
INNALCLPQLTLFL 


7282 


21183 


A 


7342 


210 


351 


RGLGTVAHACNLNTFGG*GRWII*AHEF 
E TSLRNI TKPSLRENPDML 



998 



